International Journal of Computational Intelligence Systems (Jan 2024)

A LDA-Based Social Media Data Mining Framework for Plastic Circular Economy

  • Yangyimin Xue,
  • Chandrasekhar Kambhampati,
  • Yongqiang Cheng,
  • Nishikant Mishra,
  • Nur Wulandhari,
  • Pauline Deutz

DOI
https://doi.org/10.1007/s44196-023-00375-7
Journal volume & issue
Vol. 17, no. 1
pp. 1 – 14

Abstract

Read online

Abstract The mass production of plastic waste has caused an urgent worldwide public health crisis. Although government policies and industrial innovation are the driving forces to meet this challenge, trying to understand public attitudes may improve the efficiency of this process. Social media has become the main ways for the public to obtain information and express opinions and feelings. This motivated us to mine the perceptions and behavioral responses towards plastic usage using social media data. In this paper, we proposed a framework for data collection and analysis based on mainstream media in the UK to obtain public opinions on plastics. An unsupervised machine learning model based on Latent Dirichlet Allocation (LDA) has been employed to analyse and cluster the topics to deal with the lack of annotation of the data contents. An additional dictionary method was then proposed to evaluate the sentiment of the comments. The framework also provides tools to visualise the model and results to stimulate insightful understandings. We validated the framework's effectiveness by applying it to analyse three mainstream social media, where 6 first-level topic categories and 13 second-level topic categories from the comment texts related to plastics have been identified. The results show that public sentiment towards plastic products is generally stable. The spatiotemporal distribution of each topic's sentiment is highly correlated with the number of occurrences.

Keywords