‘Caveat emptor’: the cautionary tale of endocarditis and the potential pitfalls of clinical coding data—an electronic health records study

Nicola Fawcett; Bernadette Young; Leon Peto; T. Phuong Quan; Richard Gillott; Jianhua Wu; Chris Middlemass; Sheila Weston; Derrick W. Crook; Tim E. A. Peto; Berit Muller-Pebody; Alan P. Johnson; A. Sarah Walker; Jonathan A. T. Sandoe

doi:10.1186/s12916-019-1390-x

BMC Medicine (Sep 2019)

‘Caveat emptor’: the cautionary tale of endocarditis and the potential pitfalls of clinical coding data—an electronic health records study

Nicola Fawcett,
Bernadette Young,
Leon Peto,
T. Phuong Quan,
Richard Gillott,
Jianhua Wu,
Chris Middlemass,
Sheila Weston,
Derrick W. Crook,
Tim E. A. Peto,
Berit Muller-Pebody,
Alan P. Johnson,
A. Sarah Walker,
Jonathan A. T. Sandoe

Affiliations

Nicola Fawcett: National Institute for Health Research (NIHR) Health Protection Research Unit on Healthcare Associated Infections and Antimicrobial Resistance, John Radcliffe Hospital
Bernadette Young: Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital
Leon Peto: National Institute for Health Research (NIHR) Health Protection Research Unit on Healthcare Associated Infections and Antimicrobial Resistance, John Radcliffe Hospital
T. Phuong Quan: National Institute for Health Research (NIHR) Health Protection Research Unit on Healthcare Associated Infections and Antimicrobial Resistance, John Radcliffe Hospital
Richard Gillott: Department of Cardiology, Leeds Teaching Hospitals NHS Trust and University of Leeds
Jianhua Wu: School of Dentistry, University of Leeds
Chris Middlemass: Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital
Sheila Weston: Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital
Derrick W. Crook: National Institute for Health Research (NIHR) Health Protection Research Unit on Healthcare Associated Infections and Antimicrobial Resistance, John Radcliffe Hospital
Tim E. A. Peto: National Institute for Health Research (NIHR) Health Protection Research Unit on Healthcare Associated Infections and Antimicrobial Resistance, John Radcliffe Hospital
Berit Muller-Pebody: National Infection Service, Public Health England
Alan P. Johnson: National Institute for Health Research (NIHR) Health Protection Research Unit on Healthcare Associated Infections and Antimicrobial Resistance, John Radcliffe Hospital
A. Sarah Walker: National Institute for Health Research (NIHR) Health Protection Research Unit on Healthcare Associated Infections and Antimicrobial Resistance, John Radcliffe Hospital
Jonathan A. T. Sandoe: Department of Microbiology, Leeds Teaching Hospitals NHS Trust and University of Leeds

DOI: https://doi.org/10.1186/s12916-019-1390-x
Journal volume & issue: Vol. 17, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Background Diagnostic codes from electronic health records are widely used to assess patterns of disease. Infective endocarditis is an uncommon but serious infection, with objective diagnostic criteria. Electronic health records have been used to explore the impact of changing guidance on antibiotic prophylaxis for dental procedures on incidence, but limited data on the accuracy of the diagnostic codes exists. Endocarditis was used as a clinically relevant case study to investigate the relationship between clinical cases and diagnostic codes, to understand discrepancies and to improve design of future studies. Methods Electronic health record data from two UK tertiary care centres were linked with data from a prospectively collected clinical endocarditis service database (Leeds Teaching Hospital) or retrospective clinical audit and microbiology laboratory blood culture results (Oxford University Hospitals Trust). The relationship between diagnostic codes for endocarditis and confirmed clinical cases according to the objective Duke criteria was assessed, and impact on estimations of disease incidence and trends. Results In Leeds 2006–2016, 738/1681(44%) admissions containing any endocarditis code represented a definite/possible case, whilst 263/1001(24%) definite/possible endocarditis cases had no endocarditis code assigned. In Oxford 2010–2016, 307/552(56%) reviewed endocarditis-coded admissions represented a clinical case. Diagnostic codes used by most endocarditis studies had good positive predictive value (PPV) but low sensitivity (e.g. I33-primary 82% and 43% respectively); one (I38-secondary) had PPV under 6%. Estimating endocarditis incidence using raw admission data overestimated incidence trends twofold. Removing records with non-specific codes, very short stays and readmissions improved predictive ability. Estimating incidence of streptococcal endocarditis using secondary codes also overestimated increases in incidence over time. Reasons for discrepancies included changes in coding behaviour over time, and coding guidance allowing assignment of a code mentioning ‘endocarditis’ where endocarditis was never mentioned in the clinical notes. Conclusions Commonly used diagnostic codes in studies of endocarditis had good predictive ability. Other apparently plausible codes were poorly predictive. Use of diagnostic codes without examining sensitivity and predictive ability can give inaccurate estimations of incidence and trends. Similar considerations may apply to other diseases. Health record studies require validation of diagnostic codes and careful data curation to minimise risk of serious errors.

Published in BMC Medicine

ISSN: 1741-7015 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: http://bmcmedicine.biomedcentral.com

About the journal

Abstract

Keywords