dc.contributor.author | Ağun, Hayri Volkan | |
dc.contributor.author | Yılmazel, Sibel | |
dc.contributor.author | Yılmazel, Özgür | |
dc.contributor.editor | Nie, JY | |
dc.contributor.editor | Obradovic, Z | |
dc.contributor.editor | Suzumura, T | |
dc.date.accessioned | 2019-10-18T18:30:10Z | |
dc.date.available | 2019-10-18T18:30:10Z | |
dc.date.issued | 2017 | |
dc.identifier.isbn | 978-1-5386-2715-0 | |
dc.identifier.issn | 2639-1589 | |
dc.identifier.uri | https://hdl.handle.net/11421/10157 | |
dc.description | IEEE International Conference on Big Data (IEEE Big Data) -- DEC 11-14, 2017 -- Boston, MA | en_US |
dc.description | WOS: 000428073701109 | en_US |
dc.description.abstract | In this study we present a text categorization approach for stylometric analysis of Turkish documents. To this end we extend traditional features and take advantage of linguistic processing. We experiment with a combination of bag of stems and additional features such as function words, part of speech tags and morpho-syntactic tags in datasets having varying number of authors. Based on the characteristics of Turkish (agglutinative language) we expected morpho-syntactic tags to perform better. However, neither part of speech tags nor morpho syntactic tags has showed a significant gain in our settings. Our findings suggest that the main performance is dominated with bag of stem features and the best performance is achieved with combination of bag of stems and function words. | en_US |
dc.description.sponsorship | IEEE, IEEE Comp Soc, ELSEVIER, CISCO | en_US |
dc.language.iso | eng | en_US |
dc.publisher | IEEE | en_US |
dc.relation.ispartofseries | IEEE International Conference on Big Data | |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Natural Language Processing | en_US |
dc.subject | Feature Extraction | en_US |
dc.subject | Text Classification | en_US |
dc.subject | Authorship Attribution | en_US |
dc.title | Effects of Language Processing in Turkish Authorship Attribution | en_US |
dc.type | conferenceObject | en_US |
dc.relation.journal | 2017 IEEE International Conference On Big Data (Big Data) | en_US |
dc.contributor.department | Anadolu Üniversitesi | en_US |
dc.contributor.authorID | Agun, Hayri Volkan/0000-0002-4253-8920 | en_US |
dc.identifier.startpage | 1876 | en_US |
dc.identifier.endpage | 1881 | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.contributor.institutionauthor | Yılmazel, Özgür | |