Comparison of methods for predicting COVID-19-related death in the general population using the OpenSAFELY platform

The OpenSAFELY Collaborative; Elizabeth J. Williamson; John Tazare; Krishnan Bhaskaran; Helen I. McDonald; Alex J. Walker; Laurie Tomlinson; Kevin Wing; Sebastian Bacon; Chris Bates; Helen J. Curtis; Harriet J. Forbes; Caroline Minassian; Caroline E. Morton; Emily Nightingale; Amir Mehrkar; David Evans; Brian D. Nicholson; David A. Leon; Peter Inglesby; Brian MacKenna; Nicholas G. Davies; Nicholas J. DeVito; Henry Drysdale; Jonathan Cockburn; William J. Hulme; Jessica Morley; Ian Douglas; Christopher T. Rentsch; Rohini Mathur; Angel Wong; Anna Schultze; Richard Croker; John Parry; Frank Hester; Sam Harper; Richard Grieve; David A. Harrison; Ewout W. Steyerberg; Rosalind M. Eggo; Karla Diaz-Ordaz; Ruth Keogh; Stephen J. W. Evans; Liam Smeeth; Ben Goldacre

doi:10.1186/s41512-022-00120-2

Diagnostic and Prognostic Research (Feb 2022)

Comparison of methods for predicting COVID-19-related death in the general population using the OpenSAFELY platform

The OpenSAFELY Collaborative,
Elizabeth J. Williamson,
John Tazare,
Krishnan Bhaskaran,
Helen I. McDonald,
Alex J. Walker,
Laurie Tomlinson,
Kevin Wing,
Sebastian Bacon,
Chris Bates,
Helen J. Curtis,
Harriet J. Forbes,
Caroline Minassian,
Caroline E. Morton,
Emily Nightingale,
Amir Mehrkar,
David Evans,
Brian D. Nicholson,
David A. Leon,
Peter Inglesby,
Brian MacKenna,
Nicholas G. Davies,
Nicholas J. DeVito,
Henry Drysdale,
Jonathan Cockburn,
William J. Hulme,
Jessica Morley,
Ian Douglas,
Christopher T. Rentsch,
Rohini Mathur,
Angel Wong,
Anna Schultze,
Richard Croker,
John Parry,
Frank Hester,
Sam Harper,
Richard Grieve,
David A. Harrison,
Ewout W. Steyerberg,
Rosalind M. Eggo,
Karla Diaz-Ordaz,
Ruth Keogh,
Stephen J. W. Evans,
Liam Smeeth,
Ben Goldacre

Affiliations

The OpenSAFELY Collaborative
Elizabeth J. Williamson: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
John Tazare: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Krishnan Bhaskaran: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Helen I. McDonald: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Alex J. Walker: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Laurie Tomlinson: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Kevin Wing: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Sebastian Bacon: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Chris Bates: TPP
Helen J. Curtis: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Harriet J. Forbes: University of Bristol
Caroline Minassian: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Caroline E. Morton: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Emily Nightingale: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Amir Mehrkar: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
David Evans: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Brian D. Nicholson: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
David A. Leon: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Peter Inglesby: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Brian MacKenna: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Nicholas G. Davies: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Nicholas J. DeVito: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Henry Drysdale: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Jonathan Cockburn: TPP
William J. Hulme: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Jessica Morley: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
Ian Douglas: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Christopher T. Rentsch: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Rohini Mathur: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Angel Wong: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Anna Schultze: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Richard Croker: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford
John Parry: TPP
Frank Hester: TPP
Sam Harper: TPP
Richard Grieve: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
David A. Harrison: Intensive Care National Audit & Research Centre (ICNARC)
Ewout W. Steyerberg: Leiden University Medical Center
Rosalind M. Eggo: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Karla Diaz-Ordaz: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Ruth Keogh: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Stephen J. W. Evans: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Liam Smeeth: London School of Hygiene and Tropical Medicine, Faculty of Epidemiology & Population Health
Ben Goldacre: The DataLab, Nuffield Department of Primary Care Health Sciences, University of Oxford

DOI: https://doi.org/10.1186/s41512-022-00120-2
Journal volume & issue: Vol. 6, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Background Obtaining accurate estimates of the risk of COVID-19-related death in the general population is challenging in the context of changing levels of circulating infection. Methods We propose a modelling approach to predict 28-day COVID-19-related death which explicitly accounts for COVID-19 infection prevalence using a series of sub-studies from new landmark times incorporating time-updating proxy measures of COVID-19 infection prevalence. This was compared with an approach ignoring infection prevalence. The target population was adults registered at a general practice in England in March 2020. The outcome was 28-day COVID-19-related death. Predictors included demographic characteristics and comorbidities. Three proxies of local infection prevalence were used: model-based estimates, rate of COVID-19-related attendances in emergency care, and rate of suspected COVID-19 cases in primary care. We used data within the TPP SystmOne electronic health record system linked to Office for National Statistics mortality data, using the OpenSAFELY platform, working on behalf of NHS England. Prediction models were developed in case-cohort samples with a 100-day follow-up. Validation was undertaken in 28-day cohorts from the target population. We considered predictive performance (discrimination and calibration) in geographical and temporal subsets of data not used in developing the risk prediction models. Simple models were contrasted to models including a full range of predictors. Results Prediction models were developed on 11,972,947 individuals, of whom 7999 experienced COVID-19-related death. All models discriminated well between individuals who did and did not experience the outcome, including simple models adjusting only for basic demographics and number of comorbidities: C-statistics 0.92–0.94. However, absolute risk estimates were substantially miscalibrated when infection prevalence was not explicitly modelled. Conclusions Our proposed models allow absolute risk estimation in the context of changing infection prevalence but predictive performance is sensitive to the proxy for infection prevalence. Simple models can provide excellent discrimination and may simplify implementation of risk prediction tools.

Published in Diagnostic and Prognostic Research

ISSN: 2397-7523 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General)
Website: https://diagnprognres.biomedcentral.com/

About the journal

Abstract

Keywords