Misclassifications: The Missing Link
Title | Misclassifications: The Missing Link |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Authors | Thankaraj, A., Nair, A. J., Vasudevan, N., Pathari, V. |
Conference Name | 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) |
ISBN Number | 978-1-5090-6367-3 |
Keywords | author attribution problem, authors, book similarity, classification, domain experts, Electronic mail, feature extraction, Human Behavior, learning (artificial intelligence), Length measurement, literary style, Metrics, misclassifications, misclassified books, missing link, proximity, pubcrawl, recommendation system, recommender systems, statistical analysis, statistical methods, stylometric cues, stylometry, Support vector machines, Syntactics, text analysis, Training, Writing, writing style moulds |
Abstract | The notion of style is pivotal to literature. The choice of a certain writing style moulds and enhances the overall character of a book. Stylometry uses statistical methods to analyze literary style. This work aims to build a recommendation system based on the similarity in stylometric cues of various authors. The problem at hand is in close proximity to the author attribution problem. It follows a supervised approach with an initial corpus of books labelled with their respective authors as training set and generate recommendations based on the misclassified books. Results in book similarity are substantiated by domain experts. |
URL | https://ieeexplore.ieee.org/document/8126091/ |
DOI | 10.1109/ICACCI.2017.8126091 |
Citation Key | thankaraj_misclassifications:_2017 |
- missing link
- writing style moulds
- Writing
- Training
- text analysis
- Syntactics
- Support vector machines
- stylometry
- stylometric cues
- statistical methods
- statistical analysis
- recommender systems
- recommendation system
- pubcrawl
- proximity
- author attribution problem
- misclassified books
- misclassifications
- Metrics
- literary style
- Length measurement
- learning (artificial intelligence)
- Human behavior
- feature extraction
- Electronic mail
- domain experts
- classification
- book similarity
- authors