Visible to the public Authorship Attribution Analysis of Thai Online Messages

TitleAuthorship Attribution Analysis of Thai Online Messages
Publication TypeConference Paper
Year of Publication2014
AuthorsMarukatat, R., Somkiadcharoen, R., Nalintasnai, R., Aramboonpong, T.
Conference NameInformation Science and Applications (ICISA), 2014 International Conference on
Date PublishedMay
KeywordsAccuracy, author identification, authorship attribution analysis, C4.5 decision tree, Decision trees, Kernel, natural language processing, support vector machine, Support vector machines, SVM, Thai online messages, Training, Training data, Writing, writing attributes
Abstract

This paper presents a framework to identify the authors of Thai online messages. The identification is based on 53 writing attributes and the selected algorithms are support vector machine (SVM) and C4.5 decision tree. Experimental results indicate that the overall accuracies achieved by the SVM and the C4.5 were 79% and 75%, respectively. This difference was not statistically significant (at 95% confidence interval). As for the performance of identifying individual authors, in some cases the SVM was clearly better than the C4.5. But there were also other cases where both of them could not distinguish one author from another.

DOI10.1109/ICISA.2014.6847369
Citation Key6847369