Revista Română de Informatică și Automatică (Dec 2022)

Evaluating sentiment analysis for Arabic Tweets using machine learning and deep learning

  • Areej ALSHUTAYRI,
  • Huda ALAMOUDI,
  • Boushra ALSHEHRI,
  • Eman ALDHAHRI,
  • Iqbal ALSALEH,
  • Nahla ALJOJO,
  • Abdullah ALGHOSON

DOI
https://doi.org/10.33436/v32i4y202201
Journal volume & issue
Vol. 32, no. 4
pp. 7 – 18

Abstract

Read online

Sentiment analysis is concerned with determining whether a certain material contains online information which expresses positive or negative sentiments. The tools for performing this analysis should be able to identify and assess thoughts and feelings with a reasonable degree of accuracy on feelings that are made openly available by people. It is expected that sentiment analysis would be performed for social media. That is why this paper investigates online social media, as sentiment analysis has become an important subject, and it is one of the approaches employed in the field of natural language processing. Sentiment analysis was applied for an Arabic Twitter dataset in order to identify the feelings expressed by the textual tweets and determine whether they were positive, negative, or neutral. Bigrams and unigrams were used when employing the multinomial Naïve Bayes, Gaussian Naïve Bayes, Logistic Regression, and Support Vector Machines (SVM) machine learning algorithms. The Logistic Regression algorithm achieved the highest accuracy, that is with 63.40%. The Long Short-Term Memory (LSTM) neural network was used for the deep learning-based classification, and it reached an accuracy rate of 70%, a figure which proved to be higher than the results shown in the related works.

Keywords