Sentiment Analysis Using Hybrid Model of Stacked Auto-Encoder-Based Feature Extraction and Long Short Term Memory-Based Classification Approach

Iqra Kanwal; Fazli Wahid; Sikandar Ali; Ateeq-Ur Rehman; Ahmed Alkhayyat; Akram Al-Radaei

doi:10.1109/ACCESS.2023.3313189

IEEE Access (Jan 2023)

Sentiment Analysis Using Hybrid Model of Stacked Auto-Encoder-Based Feature Extraction and Long Short Term Memory-Based Classification Approach

Iqra Kanwal,
Fazli Wahid,
Sikandar Ali,
Ateeq-Ur Rehman,
Ahmed Alkhayyat,
Akram Al-Radaei

Affiliations

Iqra Kanwal: ORCiD; Department of Information Technology, The University of Haripur, Haripur, Pakistan
Fazli Wahid: Department of Information Technology, The University of Haripur, Haripur, Pakistan
Sikandar Ali: ORCiD; Department of Information Technology, The University of Haripur, Haripur, Pakistan
Ateeq-Ur Rehman: ORCiD; Department of Information Technology, The University of Haripur, Haripur, Pakistan
Ahmed Alkhayyat: ORCiD; College of Technical Engineering, The Islamic University, Najaf, Iraq
Akram Al-Radaei: Information Technology Department, Thamar University, Thamar, Yemen

DOI: https://doi.org/10.1109/ACCESS.2023.3313189
Journal volume & issue: Vol. 11
pp. 124181 – 124197

Abstract

Read online

Customer reviews about a brand or product, movie reviews, and social media reviews can be analyzed through sentiment analysis. Sentiment analysis is used to identify the emotional tone of language to comprehend the attitudes, opinions, and feelings represented in online reviews. As for large data, it is a task that can take a lot of time and can be automated as the machine learns through the training and testing of data. Previously, various standard machine learning and deep learning models namely Recurrent Neural Network (RNN), Convolutional Neural Network (CNN), Long Short Term Memory (LSTM), Naïve Bayes (NB), Support Vector Machine (SVM), Gated Recurrent Unit (GRU) have been used. The key issue in our research is that when text is provided to LSTM directly, it cannot adequately extract informative features from the text, leading to less accurate findings. The softmax layer of Stacked Auto-encoder when used directly to categorize the extracted features, is power-constrained and unable to do so accurately. A hybrid of the Stacked Auto-encoder (SAE) and LSTM models was proposed. SAE is used for the extraction of relevant informative features. LSTM was used for further classification of sentiments based on the extracted features. The proposed model is evaluated on an IMDB dataset by splitting it into five different training testing ratios using the following performance evaluation metrics: confusion matrix, classification accuracy, precision, recall, sensitivity, specificity, and F1 score. The hybrid results performed best at a ratio of 90/10 and classified sentiments with an accuracy of 87%. The accuracy of proposed hybrid model is better than that of standard models namely RNN, CNN, LSTM, NB, SVM, and GRU.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords