Examining the impact of stemming on clustering Turkish texts
dc.authorid | 0000-0002-2735-7996 | en_US |
dc.contributor.author | Tunali V. | |
dc.contributor.author | Bilgin T.T. | |
dc.date.accessioned | 2024-07-12T22:02:12Z | |
dc.date.available | 2024-07-12T22:02:12Z | |
dc.date.issued | 2012 | en_US |
dc.department | Maltepe Üniversitesi, Rektörlük | en_US |
dc.description | International Symposium on INnovations in Intelligent SysTems and Applications, INISTA 2012 -- 2 July 2012 through 4 July 2012 -- Trabzon -- 92831 | en_US |
dc.description.abstract | Preprocessing is an important step in information retrieval and text mining. In this study, we examined the impact of stemming on clustering Turkish texts. We used two datasets compiled from web sites of Turkish news agencies, and performed extensive experiments. We empirically show that there is no significant evidence that stemming always improves the quality of clustering for texts in Turkish. However, when stemming is used, dimensionality of the document-term matrix dramatically decreases without inversely affecting the clustering performance. As a result, it is highly recommended to apply stemming for clustering Turkish texts. © 2012 IEEE. | en_US |
dc.identifier.doi | 10.1109/INISTA.2012.6246966 | |
dc.identifier.isbn | 9.78147E+12 | |
dc.identifier.scopus | 2-s2.0-84866634611 | en_US |
dc.identifier.uri | https://dx.doi.org/10.1109/INISTA.2012.6246966 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12415/9114 | |
dc.indekslendigikaynak | Scopus | |
dc.language.iso | en | en_US |
dc.relation.ispartof | INISTA 2012 - International Symposium on INnovations in Intelligent SysTems and Applications | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.snmz | KY07461 | |
dc.subject | data mining | en_US |
dc.subject | document clustering | en_US |
dc.subject | preprocessing | en_US |
dc.subject | stemming | en_US |
dc.subject | text mining | en_US |
dc.title | Examining the impact of stemming on clustering Turkish texts | en_US |
dc.type | Conference Object | |
dspace.entity.type | Publication |