Ensemble Classifiers for Arabic Sentiment Analysis of Social Network (Twitter Data) towards COVID-19-Related Conspiracy Theories

Abdullah Al-Hashedi; Belal Al-Fuhaidi; Abdulqader M. Mohsen; Yousef Ali; Hasan Ali Gamal Al-Kaf; Wedad Al-Sorori; Naseebah Maqtary

doi:10.1155/2022/6614730

Applied Computational Intelligence and Soft Computing (Jan 2022)

Ensemble Classifiers for Arabic Sentiment Analysis of Social Network (Twitter Data) towards COVID-19-Related Conspiracy Theories

Abdullah Al-Hashedi,
Belal Al-Fuhaidi,
Abdulqader M. Mohsen,
Yousef Ali,
Hasan Ali Gamal Al-Kaf,
Wedad Al-Sorori,
Naseebah Maqtary

Affiliations

Abdullah Al-Hashedi: Faculty of Computing and IT
Belal Al-Fuhaidi: Faculty of Computing and IT
Abdulqader M. Mohsen: Faculty of Computing and IT
Yousef Ali: Faculty of Computing and IT
Hasan Ali Gamal Al-Kaf: Faculty of Computing and IT
Wedad Al-Sorori: Faculty of Computing and IT
Naseebah Maqtary: Faculty of Computing and IT

DOI: https://doi.org/10.1155/2022/6614730
Journal volume & issue: Vol. 2022

Abstract

Read online

Sentiment analysis has recently become increasingly important with a massive increase in online content. It is associated with the analysis of textual data generated by social media that can be easily accessed, obtained, and analyzed. With the emergence of COVID-19, most published studies related to COVID-19’s conspiracy theories were surveys on the people's sentiments and opinions and studied the impact of the pandemic on their lives. Just a few studies utilized sentiment analysis of social media using a machine learning approach. These studies focused more on sentiment analysis of Twitter tweets in the English language and did not pay more attention to other languages such as Arabic. This study proposes a machine learning model to analyze the Arabic tweets from Twitter. In this model, we apply Word2Vec for word embedding which formed the main source of features. Two pretrained continuous bag-of-words (CBOW) models are investigated, and Naïve Bayes was used as a baseline classifier. Several single-based and ensemble-based machine learning classifiers have been used with and without SMOTE (synthetic minority oversampling technique). The experimental results show that applying word embedding with an ensemble and SMOTE achieved good improvement on average of F1 score compared to the baseline classifier and other classifiers (single-based and ensemble-based) without SMOTE.

Published in Applied Computational Intelligence and Soft Computing

ISSN: 1687-9724 (Print); 1687-9732 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://onlinelibrary.wiley.com/journal/4795

About the journal