A challenge of authorship identification for ten-thousand-scale microblog users
Title | A challenge of authorship identification for ten-thousand-scale microblog users |
Publication Type | Conference Paper |
Year of Publication | 2014 |
Authors | Okuno, S., Asai, H., Yamana, H. |
Conference Name | Big Data (Big Data), 2014 IEEE International Conference on |
Date Published | Oct |
Keywords | authorship attribution, authorship detection, authorship identification, Big Data, Blogs, Computers, Distance measurement, Internet, Internet security issues, microblog, microblog texts, security, security of data, social networking (online), ten-thousand-scale microblog users, Training, Twitter |
Abstract | Internet security issues require authorship identification for all kinds of internet contents; however, authorship identification for microblog users is much harder than other documents because microblog texts are too short. Moreover, when the number of candidates becomes large, i.e., big data, it will take long time to identify. Our proposed method solves these problems. The experimental results show that our method successfully identifies the authorship with 53.2% of precision out of 10,000 microblog users in the almost half execution time of previous method. |
DOI | 10.1109/BigData.2014.7004491 |
Citation Key | 7004491 |