IEEE Access (Jan 2022)

Arabic Rumor Detection Using Contextual Deep Bidirectional Language Modeling

  • Naelah O. Bahurmuz,
  • Ghada A. Amoudi,
  • Fatmah A. Baothman,
  • Amani T. Jamal,
  • Hanan S. Alghamdi,
  • Areej M. Alhothali

DOI
https://doi.org/10.1109/ACCESS.2022.3217522
Journal volume & issue
Vol. 10
pp. 114907 – 114918

Abstract

Read online

In today’s world, news outlets have changed dramatically; newspapers are obsolete, and radio is no longer in the picture. People look for news online and on social media, such as Twitter and Facebook. Social media contributors share information and trending stories before verifying their truthfulness, thus, spreading rumors. Early identification of rumors from social media has attracted many researchers. However, a relatively smaller number of studies focused on other languages, such as Arabic. In this study, an Arabic rumor detection model is proposed. The model was built using transformer-based deep learning architecture. According to the literature, transformers are neural networks with outstanding performance in natural language processing tasks. Two transformers-based models, AraBERT and MARBERT, were employed, tested, and evaluated using three recently developed Arabic datasets. These models are extensions to the BERT, Bidirectional Encoder Representations from Transformers, a deep learning model that uses transformer architecture to learn the text representations and leverages the attention mechanism. We have also mitigated the challenges introduced by the imbalanced training datasets by employing two sampling techniques. The experimental results of our proposed approaches achieved a maximum accuracy of 0.97. This result demonstrated the effectiveness of the proposed method and outperformed other existing Arabic rumor detection methods.

Keywords