Analysis and Classification of Fake News Using Sequential Pattern Mining

M. Zohaib Nawaz; M. Saqib Nawaz; Philippe Fournier-Viger; Yulin He

doi:10.26599/BDMA.2024.9020015

Big Data Mining and Analytics (Sep 2024)

Analysis and Classification of Fake News Using Sequential Pattern Mining

M. Zohaib Nawaz,
M. Saqib Nawaz,
Philippe Fournier-Viger,
Yulin He

Affiliations

M. Zohaib Nawaz: College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060
M. Saqib Nawaz: College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060
Philippe Fournier-Viger: College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060
Yulin He: Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen 518107, China

DOI: https://doi.org/10.26599/BDMA.2024.9020015
Journal volume & issue: Vol. 7, no. 3
pp. 942 – 963

Abstract

Read online

Disinformation, often known as fake news, is a major issue that has received a lot of attention lately. Many researchers have proposed effective means of detecting and addressing it. Current machine and deep learning based methodologies for classification/detection of fake news are content-based, network (propagation) based, or multimodal methods that combine both textual and visual information. We introduce here a framework, called FNACSPM, based on sequential pattern mining (SPM), for fake news analysis and classification. In this framework, six publicly available datasets, containing a diverse range of fake and real news, and their combination, are first transformed into a proper format. Then, algorithms for SPM are applied to the transformed datasets to extract frequent patterns (and rules) of words, phrases, or linguistic features. The obtained patterns capture distinctive characteristics associated with fake or real news content, providing valuable insights into the underlying structures and commonalities of misinformation. Subsequently, the discovered frequent patterns are used as features for fake news classification. This framework is evaluated with eight classifiers, and their performance is assessed with various metrics. Extensive experiments were performed and obtained results show that FNACSPM outperformed other state-of-the-art approaches for fake news classification, and that it expedites the classification task with high accuracy.

Published in Big Data Mining and Analytics

ISSN: 2096-0654 (Print)
Publisher: Tsinghua University Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=8254253

About the journal

Abstract

Keywords