Identification, analysis and prediction of valid and false information related to vaccines from Romanian tweets

Andrei Valeanu; Dragos Paul Mihai; Corina Andrei; Ciprian Puscasu; Alexandra Mihaela Ionica; Miruna Ioana Hinoveanu; Valentina Patricia Predoi; Ema Bulancea; Cornel Chirita; Simona Negres; Cristian Daniel Marineci

doi:10.3389/fpubh.2024.1330801

Frontiers in Public Health (Feb 2024)

Identification, analysis and prediction of valid and false information related to vaccines from Romanian tweets

Andrei Valeanu,
Dragos Paul Mihai,
Corina Andrei,
Ciprian Puscasu,
Alexandra Mihaela Ionica,
Miruna Ioana Hinoveanu,
Valentina Patricia Predoi,
Ema Bulancea,
Cornel Chirita,
Simona Negres,
Cristian Daniel Marineci

Affiliations

Andrei Valeanu
Dragos Paul Mihai
Corina Andrei
Ciprian Puscasu
Alexandra Mihaela Ionica
Miruna Ioana Hinoveanu
Valentina Patricia Predoi
Ema Bulancea
Cornel Chirita
Simona Negres
Cristian Daniel Marineci

DOI: https://doi.org/10.3389/fpubh.2024.1330801
Journal volume & issue: Vol. 12

Abstract

Read online

IntroductionThe online misinformation might undermine the vaccination efforts. Therefore, given the fact that no study specifically analyzed online vaccine related content written in Romanian, the main objective of the study was to detect and evaluate tweets related to vaccines and written in Romanian language.Methods1,400 Romanian vaccine related tweets were manually classified in true, neutral and fake information and analyzed based on wordcloud representations, a correlation analysis between the three classes and specific tweet characteristics and the validation of several predictive machine learning algorithms.Results and discussionThe tweets annotated as misinformation showed specific word patterns and were liked and reshared more often as compared to the true and neutral ones. The validation of the machine learning algorithms yielded enhanced results in terms of Area Under the Receiver Operating Characteristic Curve Score (0.744–0.843) when evaluating the Support Vector Classifier. The predictive model estimates in a well calibrated manner the probability that a specific Twitter post is true, neutral or fake. The current study offers important insights regarding vaccine related online content written in an Eastern European language. Future studies must aim at building an online platform for rapid identification of vaccine misinformation and raising awareness for the general population.

Published in Frontiers in Public Health

ISSN: 2296-2565 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Public aspects of medicine
Website: https://www.frontiersin.org/journals/public-health

About the journal

Abstract

Keywords