Can Fake News Detection Models Maintain the Performance through Time? A Longitudinal Evaluation of Twitter Publications

Nuno Guimarães; Álvaro Figueira; Luís Torgo

doi:10.3390/math9222988

Mathematics (Nov 2021)

Can Fake News Detection Models Maintain the Performance through Time? A Longitudinal Evaluation of Twitter Publications

Nuno Guimarães,
Álvaro Figueira,
Luís Torgo

Affiliations

Nuno Guimarães: CRACS-INESCTEC, University of Porto, 4169-007 Porto, Portugal
Álvaro Figueira: CRACS-INESCTEC, University of Porto, 4169-007 Porto, Portugal
Luís Torgo: Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 1W5, Canada

DOI: https://doi.org/10.3390/math9222988
Journal volume & issue: Vol. 9, no. 22
p. 2988

Abstract

Read online

The negative impact of false information on social networks is rapidly growing. Current research on the topic focused on the detection of fake news in a particular context or event (such as elections) or using data from a short period of time. Therefore, an evaluation of the current proposals in a long-term scenario where the topics discussed may change is lacking. In this work, we deviate from current approaches to the problem and instead focus on a longitudinal evaluation using social network publications spanning an 18-month period. We evaluate different combinations of features and supervised models in a long-term scenario where the training and testing data are ordered chronologically, and thus the robustness and stability of the models can be evaluated through time. We experimented with 3 different scenarios where the models are trained with 15-, 30-, and 60-day data periods. The results show that detection models trained with word-embedding features are the ones that perform better and are less likely to be affected by the change of topics (for example, the rise of COVID-19 conspiracy theories). Furthermore, the additional days of training data also increase the performance of the best feature/model combinations, although not very significantly (around 2%). The results presented in this paper build the foundations towards a more pragmatic approach to the evaluation of fake news detection models in social networks.

Published in Mathematics

ISSN: 2227-7390 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/mathematics

About the journal

Abstract

Keywords