Misinformation Features Detection in Weibo: Unsupervised Learning, Latent Dirichlet Allocation, and Network Structure

Yiping Li; Xuanfeng Li; Yuejing Zhai; Di Wang; Chitin Hon

doi:10.1109/ACCESS.2024.3494015

IEEE Access (Jan 2024)

Misinformation Features Detection in Weibo: Unsupervised Learning, Latent Dirichlet Allocation, and Network Structure

Yiping Li,
Xuanfeng Li,
Yuejing Zhai,
Di Wang,
Chitin Hon

Affiliations

Yiping Li: ORCiD; Department of Humanities and Arts, Macau University of Science and Technology, Macau, China
Xuanfeng Li: ORCiD; Institute of Systems Engineering, Macau University of Science and Technology, Macau, China
Yuejing Zhai: Macao Polytechnic University, Macau, China
Di Wang: ORCiD; Department of Humanities and Arts, Macau University of Science and Technology, Macau, China
Chitin Hon: ORCiD; Institute of Systems Engineering, Macau University of Science and Technology, Macau, China

DOI: https://doi.org/10.1109/ACCESS.2024.3494015
Journal volume & issue: Vol. 12
pp. 166977 – 166987

Abstract

Read online

This study employs an unsupervised learning technique to identify features of misinformation on Weibo, a popular social media platform in China. By utilizing BERT (Bidirectional Encoder Representations from Transformers) for sequence classification, we were able to detect a dataset with a mix of true and false information, achieving a perfect recall rate for false information detection. However, the precision for true information was low, with a 3% precision, 6% recall, and an F1 score of 6%, indicating a high rate of misclassification. To address this, we conducted a Latent Dirichlet Allocation (LDA) analysis on the misclassified true information, identifying specific features that led to incorrect classification. Additionally, social network analysis revealed the presence of structural holes within the information network. This study contributes to the understanding of misinformation detection mechanisms and provides insights into the social dynamics of information spread.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords