Evaluating sentiment analysis for Arabic Tweets using machine learning and deep learning

Areej ALSHUTAYRI; Huda ALAMOUDI; Boushra ALSHEHRI; Eman ALDHAHRI; Iqbal ALSALEH; Nahla ALJOJO; Abdullah ALGHOSON

doi:10.33436/v32i4y202201

Revista Română de Informatică și Automatică (Dec 2022)

Evaluating sentiment analysis for Arabic Tweets using machine learning and deep learning

Areej ALSHUTAYRI,
Huda ALAMOUDI,
Boushra ALSHEHRI,
Eman ALDHAHRI,
Iqbal ALSALEH,
Nahla ALJOJO,
Abdullah ALGHOSON

Affiliations

Areej ALSHUTAYRI: College of Computer Science and Engineering, Department of Computer Science and Artificial Intelligence, University of Jeddah, Jeddah, Saudi Arabia
Huda ALAMOUDI: College of Computer Science and Engineering, Department of Computer Science and Artificial Intelligence, University of Jeddah, Jeddah, Saudi Arabia
Boushra ALSHEHRI: College of Computer Science and Engineering, Department of Computer Science and Artificial Intelligence, University of Jeddah, Jeddah, Saudi Arabia
Eman ALDHAHRI: College of Computer Science and Engineering, Department of Computer Science and Artificial Intelligence, University of Jeddah, Jeddah, Saudi Arabia
Iqbal ALSALEH: Faculty of Economic and Administration, Management Information Systems Department, King AbdulAziz University, Jeddah, Saudi Arabia
Nahla ALJOJO: College of Computer Science and Engineering, Information Systems and Technology Department, University of Jeddah, Jeddah, Saudi Arabia
Abdullah ALGHOSON: College of Computer Science and Engineering, Information Systems and Technology Department, University of Jeddah, Jeddah, Saudi Arabia

DOI: https://doi.org/10.33436/v32i4y202201
Journal volume & issue: Vol. 32, no. 4
pp. 7 – 18

Abstract

Read online

Sentiment analysis is concerned with determining whether a certain material contains online information which expresses positive or negative sentiments. The tools for performing this analysis should be able to identify and assess thoughts and feelings with a reasonable degree of accuracy on feelings that are made openly available by people. It is expected that sentiment analysis would be performed for social media. That is why this paper investigates online social media, as sentiment analysis has become an important subject, and it is one of the approaches employed in the field of natural language processing. Sentiment analysis was applied for an Arabic Twitter dataset in order to identify the feelings expressed by the textual tweets and determine whether they were positive, negative, or neutral. Bigrams and unigrams were used when employing the multinomial Naïve Bayes, Gaussian Naïve Bayes, Logistic Regression, and Support Vector Machines (SVM) machine learning algorithms. The Logistic Regression algorithm achieved the highest accuracy, that is with 63.40%. The Long Short-Term Memory (LSTM) neural network was used for the deep learning-based classification, and it reached an accuracy rate of 70%, a figure which proved to be higher than the results shown in the related works.

Published in Revista Română de Informatică și Automatică

ISSN: 1220-1758 (Print); 1841-4303 (Online)
Publisher: ICI Publishing House
Country of publisher: Romania
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Automation; Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://rria.ici.ro/?lang=en

About the journal

Abstract

Keywords