An Improved Sentiment Classification Approach for Measuring User Satisfaction toward Governmental Services’ Mobile Apps Using Machine Learning Methods with Feature Engineering and SMOTE Technique

Mohammed Hadwan; Mohammed Al-Sarem; Faisal Saeed; Mohammed A. Al-Hagery

doi:10.3390/app12115547

Applied Sciences (May 2022)

An Improved Sentiment Classification Approach for Measuring User Satisfaction toward Governmental Services’ Mobile Apps Using Machine Learning Methods with Feature Engineering and SMOTE Technique

Mohammed Hadwan,
Mohammed Al-Sarem,
Faisal Saeed,
Mohammed A. Al-Hagery

Affiliations

Mohammed Hadwan: Department of Information Technology, College of Computer, Qassim University, Buraydah 51452, Saudi Arabia
Mohammed Al-Sarem: College of Computer Science and Engineering, Taibah University, Medina 42353, Saudi Arabia
Faisal Saeed: College of Computer Science and Engineering, Taibah University, Medina 42353, Saudi Arabia
Mohammed A. Al-Hagery: Department of Computer Science, College of Computer, Qassim University, Buraydah 51452, Saudi Arabia

DOI: https://doi.org/10.3390/app12115547
Journal volume & issue: Vol. 12, no. 11
p. 5547

Abstract

Read online

Analyzing the sentiment of Arabic texts is still a big research challenge due to the special characteristics and complexity of the Arabic language. Few studies have been conducted on Arabic sentiment analysis (ASA) compared to English or other Latin languages. In addition, most of the existing studies on ASA analyzed datasets collected from Twitter. However, little attention was given to the huge amounts of reviews for governmental or commercial mobile applications on Google Play or the App Store. For instance, the government of Saudi Arabia developed several mobile applications in healthcare, education, and other sectors as a response to the COVID-19 pandemic. To address this gap, this paper aims to analyze the users’ opinions of six applications in the healthcare sector. An improved sentiment classification approach was proposed for measuring user satisfaction toward governmental services’ mobile apps using machine learning models with different preprocessing methods. The Arb-AppsReview dataset was collected from the reviews of these six mobile applications available on Google Play and the App Store, which includes 51k reviews. Then, several feature engineering approaches were applied, which include Bing Liu lexicon, AFINN, and MPQA Subjectivity Lexicon, bag of words (BoW), term frequency-inverse document frequency (TF-IDF), and the Google pre-trained Word2Vec. Additionally, the SMOTE technique was applied as a balancing technique on this dataset. Then, five ML models were applied to classify the sentiment opinions. The experimental results showed that the highest accuracy score (94.38%) was obtained by applying a support vector machine (SVM) using the SMOTE technique with all concatenated features.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords