Reliability of COVID-19 data: An evaluation and reflection.

April R Miller; Samin Charepoo; Erik Yan; Ryan W Frost; Zachary J Sturgeon; Grace Gibbon; Patrick N Balius; Cedonia S Thomas; Melanie A Schmitt; Daniel A Sass; James B Walters; Tracy L Flood; Thomas A Schmitt; COVID-19 Data Project

doi:10.1371/journal.pone.0251470

PLoS ONE (Jan 2022)

Reliability of COVID-19 data: An evaluation and reflection.

April R Miller,
Samin Charepoo,
Erik Yan,
Ryan W Frost,
Zachary J Sturgeon,
Grace Gibbon,
Patrick N Balius,
Cedonia S Thomas,
Melanie A Schmitt,
Daniel A Sass,
James B Walters,
Tracy L Flood,
Thomas A Schmitt,
COVID-19 Data Project

Affiliations

April R Miller
Samin Charepoo
Erik Yan
Ryan W Frost
Zachary J Sturgeon
Grace Gibbon
Patrick N Balius
Cedonia S Thomas
Melanie A Schmitt
Daniel A Sass
James B Walters
Tracy L Flood
Thomas A Schmitt
COVID-19 Data Project

DOI: https://doi.org/10.1371/journal.pone.0251470
Journal volume & issue: Vol. 17, no. 11
p. e0251470

Abstract

Read online

ImportanceThe rapid proliferation of COVID-19 has left governments scrambling, and several data aggregators are now assisting in the reporting of county cases and deaths. The different variables affecting reporting (e.g., time delays in reporting) necessitates a well-documented reliability study examining the data methods and discussion of possible causes of differences between aggregators.ObjectiveTo statistically evaluate the reliability of COVID-19 data across aggregators using case fatality rate (CFR) estimates and reliability statistics.Design, setting, and participantsCases and deaths were collected daily by volunteers via state and local health departments, as primary sources and newspaper reports, as secondary sources. In an effort to begin comparison for reliability statistical analysis, BroadStreet collected data from other COVID-19 aggregator sources, including USAFacts, Johns Hopkins University, New York Times, The COVID Tracking Project.Main outcomes and measuresCOVID-19 cases and death counts at the county and state levels.ResultsLower levels of inter-rater agreement were observed across aggregators associated with the number of deaths, which manifested itself in state level Bayesian estimates of COVID-19 fatality rates.Conclusions and relevanceA national, publicly available data set is needed for current and future disease outbreaks and improved reliability in reporting.

Published in PLoS ONE

ISSN: 1932-6203 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine; Science
Website: https://journals.plos.org/plosone/

About the journal