Arabic Rumor Detection Using Contextual Deep Bidirectional Language Modeling

Naelah O. Bahurmuz; Ghada A. Amoudi; Fatmah A. Baothman; Amani T. Jamal; Hanan S. Alghamdi; Areej M. Alhothali

doi:10.1109/ACCESS.2022.3217522

IEEE Access (Jan 2022)

Arabic Rumor Detection Using Contextual Deep Bidirectional Language Modeling

Naelah O. Bahurmuz,
Ghada A. Amoudi,
Fatmah A. Baothman,
Amani T. Jamal,
Hanan S. Alghamdi,
Areej M. Alhothali

Affiliations

Naelah O. Bahurmuz: Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
Ghada A. Amoudi: ORCiD; Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
Fatmah A. Baothman: ORCiD; Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
Amani T. Jamal: Department of Computer Science, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
Hanan S. Alghamdi: ORCiD; Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
Areej M. Alhothali: ORCiD; Department of Computer Science, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia

DOI: https://doi.org/10.1109/ACCESS.2022.3217522
Journal volume & issue: Vol. 10
pp. 114907 – 114918

Abstract

Read online

In today’s world, news outlets have changed dramatically; newspapers are obsolete, and radio is no longer in the picture. People look for news online and on social media, such as Twitter and Facebook. Social media contributors share information and trending stories before verifying their truthfulness, thus, spreading rumors. Early identification of rumors from social media has attracted many researchers. However, a relatively smaller number of studies focused on other languages, such as Arabic. In this study, an Arabic rumor detection model is proposed. The model was built using transformer-based deep learning architecture. According to the literature, transformers are neural networks with outstanding performance in natural language processing tasks. Two transformers-based models, AraBERT and MARBERT, were employed, tested, and evaluated using three recently developed Arabic datasets. These models are extensions to the BERT, Bidirectional Encoder Representations from Transformers, a deep learning model that uses transformer architecture to learn the text representations and leverages the attention mechanism. We have also mitigated the challenges introduced by the imbalanced training datasets by employing two sampling techniques. The experimental results of our proposed approaches achieved a maximum accuracy of 0.97. This result demonstrated the effectiveness of the proposed method and outperformed other existing Arabic rumor detection methods.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords