Geo-spatial Information Science (Oct 2024)
An attention-based hybrid model for spatial and temporal sentiment analysis of COVID-19 related tweets in the contiguous United States
Abstract
Understanding the sentiments of social media posts can help health authorities respond to disease outbreaks, through a proxy measure of fear, confidence, and community compliance. Sentiment analysis identifies a pattern of emotion through the written word and assigns a positive, neutral, or negative value to it. As of February 2023, there were 677.7 million confirmed cases of coronavirus disease (COVID-19) and more than 6.7 million confirmed deaths. In this paper, around 170,000 COVID-19-related tweets were collected between September 2020 and January 2021 in the contiguous United States. Data preprocessing and exploratory investigation were completed for analysis of the collected dataset. Further, a novel and unified architecture called attention-based one-dimensional convolution with bidirectional long short-term memory layers (CNN-BiLSTM-ATT) is proposed to classify people’s sentiments as positive, neutral, and negative based on COVID-19-related tweets. In the CNN-BiLSTM-ATT model, the CNN layer can extract the low-level semantic features from textual data, and the BiLSTM layer can extract both the previous and future contextual representations. The attention module can improve the information focus from the outputted layer of the BiLSTM. The proposed method can extract both the local phrase representations and the global feature of sentences. Numerical experiments were conducted on COVID-19-related tweets using the proposed method and other baseline models to compare their performances. Our experimental results demonstrate that the CNN-BiLSTM-ATT model achieves an average accuracy of 95.16% and a macro-average F1-score of 95.12%, which outperforms the baseline models.
Keywords