PRETO: A high-performance text mining tool for preprocessing Turkish texts

dc.contributor.authorTunali, V.
dc.contributor.authorBilgin, T.T.
dc.date.accessioned2024-07-12T21:40:38Z
dc.date.available2024-07-12T21:40:38Z
dc.date.issued2012en_US
dc.department[Belirlenecek]en_US
dc.description13th International Conference on Computer Systems and Technologies, CompSysTech 2012 -- 22 June 2012 through 23 June 2012 -- Ruse -- 93756en_US
dc.description.abstractText documents are usually unstructured and written in natural language. To apply conventional data mining techniques on text documents, a preprocessing operation is indispensable. In this paper, we introduce PRETO, a cross-platform, powerful and scalable preprocessing tool developed specifically for preprocessing Turkish texts, with a wide range of preprocessing options like stemming, stopword filtering, statistical term filtering, and n-gram generation. We demonstrate the performance and scalability of PRETO with some experiments on large document collections. Copyright ©2012 ACM.en_US
dc.identifier.doi10.1145/2383276.2383297
dc.identifier.endpage140en_US
dc.identifier.isbn9.78145E+12
dc.identifier.scopus2-s2.0-84869002711en_US
dc.identifier.scopusqualityN/Aen_US
dc.identifier.startpage134en_US
dc.identifier.urihttps://doi.org/10.1145/2383276.2383297
dc.identifier.urihttps://hdl.handle.net/20.500.12415/7411
dc.indekslendigikaynakScopus
dc.language.isoenen_US
dc.relation.ispartofACM International Conference Proceeding Seriesen_US
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.snmzKY08754
dc.subjectData Miningen_US
dc.subjectNatural Language Processingen_US
dc.subjectText Miningen_US
dc.subjectText Preprocessingen_US
dc.titlePRETO: A high-performance text mining tool for preprocessing Turkish textsen_US
dc.typeConference Object
dspace.entity.typePublication

Dosyalar