Ensemble based high performance deep learning models for fake news detection

Mohammed E.Almandouh; Mohammed F. Alrahmawy; Mohamed Eisa; Mohamed Elhoseny; A. S. Tolba

doi:10.1038/s41598-024-76286-0

Scientific Reports (Nov 2024)

Ensemble based high performance deep learning models for fake news detection

Mohammed E.Almandouh,
Mohammed F. Alrahmawy,
Mohamed Eisa,
Mohamed Elhoseny,
A. S. Tolba

Affiliations

Mohammed E.Almandouh: Portsaid University
Mohammed F. Alrahmawy: Mansoura University
Mohamed Eisa: Portsaid University
Mohamed Elhoseny: Mansoura University
A. S. Tolba: Mansoura University

DOI: https://doi.org/10.1038/s41598-024-76286-0
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 24

Abstract

Read online

Abstract Social media has emerged as a dominant platform where individuals freely share opinions and communicate globally. Its role in disseminating news worldwide is significant due to its easy accessibility. However, the increase in the use of these platforms presents severe risks for potentially misleading people. Our research aims to investigate different techniques within machine learning, deep learning, and ensemble learning frameworks in Arabic fake news detection. We integrated FastText word embeddings with various machine learning and deep learning methods. We then leveraged advanced transformer-based models, including BERT, XLNet, and RoBERTa, optimizing their performance through careful hyperparameter tuning. The research methodology involves utilizing two Arabic news article datasets, AFND and ARABICFAKETWEETS datasets, categorized into fake and real subsets and applying comprehensive preprocessing techniques to the text data. Four hybrid deep learning models are presented: CNN-LSTM, RNN-CNN, RNN-LSTM, and Bi-GRU-Bi-LSTM. The Bi-GRU-Bi-LSTM model demonstrated superior performance regarding the F1 score, accuracy, and loss metrics. The precision, recall, F1 score, and accuracy of the hybrid Bi-GRU-Bi-LSTM model on the AFND Dataset are 0.97, 0.97, 0.98, and 0.98, and on the ARABICFAKETWEETS dataset are 0.98, 0.98, 0.99, and 0.99 respectively. The study’s primary conclusion is that when spotting fake news in Arabic, the Bi-GRU-Bi-LSTM model outperforms other models by a significant margin. It significantly aids the global fight against false information by setting the stage for future research to expand fake news detection to multiple languages.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords