IEEE Access (Jan 2021)

Deep Sentiment Analysis: A Case Study on Stemmed Turkish Twitter Data

  • Harisu Abdullahi Shehu,
  • Md. Haidar Sharif,
  • Md. Haris Uddin Sharif,
  • Ripon Datta,
  • Sezai Tokat,
  • Sahin Uyaver,
  • Huseyin Kusetogullari,
  • Rabie A. Ramadan

DOI
https://doi.org/10.1109/ACCESS.2021.3071393
Journal volume & issue
Vol. 9
pp. 56836 – 56854

Abstract

Read online

Sentiment analysis using stemmed Twitter data from various languages is an emerging research topic. In this paper, we address three data augmentation techniques namely Shift, Shuffle, and Hybrid to increase the size of the training data; and then we use three key types of deep learning (DL) models namely recurrent neural network (RNN), convolution neural network (CNN), and hierarchical attention network (HAN) to classify the stemmed Turkish Twitter data for sentiment analysis. The performance of these DL models has been compared with the existing traditional machine learning (TML) models. The performance of TML models has been affected negatively by the stemmed data, but the performance of DL models has been improved greatly with the utilization of the augmentation techniques. Based on the simulation, experimental, and statistical results analysis deeming identical datasets, it has been concluded that the TML models outperform the DL models with respect to both training-time (TTM) and runtime (RTM) complexities of the algorithms; but the DL models outperform the TML models with respect to the most important performance factors as well as the average performance rankings.

Keywords