Influenza-like illness surveillance on Twitter through automated learning of naïve language.

Francesco Gesualdo; Giovanni Stilo; Eleonora Agricola; Michaela V Gonfiantini; Elisabetta Pandolfi; Paola Velardi; Alberto E Tozzi

doi:10.1371/journal.pone.0082489

PLoS ONE (Jan 2013)

Influenza-like illness surveillance on Twitter through automated learning of naïve language.

Francesco Gesualdo,
Giovanni Stilo,
Eleonora Agricola,
Michaela V Gonfiantini,
Elisabetta Pandolfi,
Paola Velardi,
Alberto E Tozzi

Affiliations

Francesco Gesualdo
Giovanni Stilo
Eleonora Agricola
Michaela V Gonfiantini
Elisabetta Pandolfi
Paola Velardi
Alberto E Tozzi

DOI: https://doi.org/10.1371/journal.pone.0082489
Journal volume & issue: Vol. 8, no. 12
p. e82489

Abstract

Read online

Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algorithm that exploits the abundance of health-related web pages to identify all jargon expressions related to a specific technical term. We then translated an influenza case definition into a Boolean query, each symptom being described by a technical term and all related jargon expressions, as identified by the algorithm. Subsequently, we monitored all tweets that reported a combination of symptoms satisfying the case definition query. In order to geolocalize messages, we defined 3 localization strategies based on codes associated with each tweet. We found a high correlation coefficient between the trend of our influenza-positive tweets and ILI trends identified by US traditional surveillance systems.

Published in PLoS ONE

ISSN: 1932-6203 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine; Science
Website: https://journals.plos.org/plosone/

About the journal