Classifying Fake News Articles Using Natural Language Processing to Identify In-Article Attribution as a Supervised Learning Estimator
Title | Classifying Fake News Articles Using Natural Language Processing to Identify In-Article Attribution as a Supervised Learning Estimator |
Publication Type | Conference Paper |
Year of Publication | 2019 |
Authors | Traylor, Terry, Straub, Jeremy, Gurmeet, Snell, Nicholas |
Conference Name | 2019 IEEE 13th International Conference on Semantic Computing (ICSC) |
Date Published | Feb. 2019 |
Publisher | IEEE |
ISBN Number | 978-1-5386-6783-5 |
Keywords | attribution, Attribution Classification, Bayes methods, Bayesian machine learning system, classifier performance, component, composability, deceptive content, decision making, fake news, fake news articles, fake news classifier, fake news detector, fake news identification study, fake news stories, feature extraction, Grammar, Human Behavior, in-article attribution, Influence Mining, information accuracy, key linguistic characteristics, learning (artificial intelligence), Linguistics, machine learning, mainstream media platforms, Media, Metrics, natural language processing, news article, pattern classification, pubcrawl, radio news, resultant process precision, social media conduits, social media platforms, social networking (online), supervised learning estimator, Tools |
Abstract | Intentionally deceptive content presented under the guise of legitimate journalism is a worldwide information accuracy and integrity problem that affects opinion forming, decision making, and voting patterns. Most so-called `fake news' is initially distributed over social media conduits like Facebook and Twitter and later finds its way onto mainstream media platforms such as traditional television and radio news. The fake news stories that are initially seeded over social media platforms share key linguistic characteristics such as making excessive use of unsubstantiated hyperbole and non-attributed quoted content. In this paper, the results of a fake news identification study that documents the performance of a fake news classifier are presented. The Textblob, Natural Language, and SciPy Toolkits were used to develop a novel fake news detector that uses quoted attribution in a Bayesian machine learning system as a key feature to estimate the likelihood that a news article is fake. The resultant process precision is 63.333% effective at assessing the likelihood that an article with quotes is fake. This process is called influence mining and this novel technique is presented as a method that can be used to enable fake news and even propaganda detection. In this paper, the research process, technical analysis, technical linguistics work, and classifier performance and results are presented. The paper concludes with a discussion of how the current system will evolve into an influence mining system. |
URL | https://ieeexplore.ieee.org/document/8665593 |
DOI | 10.1109/ICOSC.2019.8665593 |
Citation Key | traylor_classifying_2019 |
- news article
- information accuracy
- key linguistic characteristics
- learning (artificial intelligence)
- Linguistics
- machine learning
- mainstream media platforms
- Media
- Metrics
- natural language processing
- Influence Mining
- pattern classification
- pubcrawl
- radio news
- resultant process precision
- social media conduits
- social media platforms
- social networking (online)
- supervised learning estimator
- tools
- fake news articles
- Attribution Classification
- Bayes methods
- Bayesian machine learning system
- classifier performance
- component
- composability
- deceptive content
- Decision Making
- fake news
- attribution
- fake news classifier
- fake news detector
- fake news identification study
- fake news stories
- feature extraction
- Grammar
- Human behavior
- in-article attribution