Constructing a User-Centered Fake News Detection Model by Using Classification Algorithms in Machine Learning Techniques

Minjung Park; Sangmi Chai

doi:10.1109/ACCESS.2023.3294613

IEEE Access (Jan 2023)

Constructing a User-Centered Fake News Detection Model by Using Classification Algorithms in Machine Learning Techniques

Minjung Park,
Sangmi Chai

Affiliations

Minjung Park: Ewha School of Business, Ewha Womans University, Seodaemun-gu, Seoul, South Korea
Sangmi Chai: ORCiD; Ewha School of Business, Ewha Womans University, Seodaemun-gu, Seoul, South Korea

DOI: https://doi.org/10.1109/ACCESS.2023.3294613
Journal volume & issue: Vol. 11
pp. 71517 – 71527

Abstract

Read online

As fake news spreads rapidly in social media, attempts to develop detection technology to automatically identify fake news are actively being developed, recently. However, most of them focus only on the linguistic and compositional characteristics of fake news (e.g., source or authors indication, length of a message, frequency of negative words). Compared to them, this study proposes a fake news detection model based on machine learning that reflects the characteristics of users, news content, and social networks based on social capital. To comprehensively reflect the characteristics related to the spread of fake news, this study applied the XGBoost model to estimate the feature importance of each variable to derive the priority factors that preferentially affect fake news detection. Based on the derived variables, we established SVM, RF, LR, CART, and NNET, which are representative classification models of machine learning, and compared the performance rate of fake news detection. To generalize the established models (i.e., to avoid overfitting or underfitting), this study performed a cross-validation step, and to compare the predictive accuracy of the established models. As a result, the RF model indicated the highest prediction rate at about 94%, while the NNET had the lowest performance rate at about 92.1%. The results of this study are expected to contribute to improve the fake news detection system in preparation for the more sophisticated generation and spread of fake news.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords