Impact of Different Electronic Cohort Definitions to Identify Patients With Atrial Fibrillation From the Electronic Medical Record

Rashmee U. Shah; Rebeka Mukherjee; Yue Zhang; Aubrey E. Jones; Jennifer Springer; Ian Hackett; Benjamin A. Steinberg; Donald M. Lloyd‐Jones; Wendy W. Chapman

doi:10.1161/JAHA.119.014527

Journal of the American Heart Association: Cardiovascular and Cerebrovascular Disease (Mar 2020)

Impact of Different Electronic Cohort Definitions to Identify Patients With Atrial Fibrillation From the Electronic Medical Record

Rashmee U. Shah,
Rebeka Mukherjee,
Yue Zhang,
Aubrey E. Jones,
Jennifer Springer,
Ian Hackett,
Benjamin A. Steinberg,
Donald M. Lloyd‐Jones,
Wendy W. Chapman

Affiliations

Rashmee U. Shah: Division of Cardiovascular Medicine Department of Internal Medicine University of Utah School of Medicine Salt Lake City UT
Rebeka Mukherjee: Division of Cardiovascular Medicine Department of Internal Medicine University of Utah School of Medicine Salt Lake City UT
Yue Zhang: Division of Epidemiology Department of Internal Medicine University of Utah School of Medicine Salt Lake City UT
Aubrey E. Jones: Department of Population Health University of Utah School of Medicine Salt Lake City UT
Jennifer Springer: Division of Cardiovascular Medicine Department of Internal Medicine University of Utah School of Medicine Salt Lake City UT
Ian Hackett: Section of Cardiology Department of Internal Medicine University of Chicago Pritzker School of Medicine Chicago IL
Benjamin A. Steinberg: Division of Cardiovascular Medicine Department of Internal Medicine University of Utah School of Medicine Salt Lake City UT
Donald M. Lloyd‐Jones: Department of Preventive Medicine Northwestern University Feinberg School of Medicine Chicago IL
Wendy W. Chapman: Centre for Digital Transformation of Health Victorian Comprehensive Cancer Centre Melbourne Australia

DOI: https://doi.org/10.1161/JAHA.119.014527
Journal volume & issue: Vol. 9, no. 5

Abstract

Read online

Background Electronic medical records (EMRs) allow identification of disease‐specific patient populations, but varying electronic cohort definitions could result in different populations. We compared the characteristics of an electronic medical record–derived atrial fibrillation (AF) patient population using 5 different electronic cohort definitions. Methods and Results Adult patients with at least 1 AF billing code from January 1, 2010, to December 31, 2017, were included. Based on different electronic cohort definitions, we trained 5 different logistic regression models using a labeled training data set (n=786). Each model yielded a predicted probability; patients were classified as having AF if the probability was higher than a specified cut point. Test characteristics were calculated for each model. These models were then applied to the full cohort and resulting characteristics were compared. In the training set, the comprehensive model (including demographics, billing codes, and natural language processing results) performed best, with an area under the curve of 0.89, sensitivity of 0.90, and specificity of 0.87. Among a candidate population (n=22 000), the proportion of patients identified as having AF varied from 61% in the model using diagnosis or procedure International Classification of Diseases (ICD) billing codes to 83% in the model using natural language processing of clinical notes. Among identified AF patients, the proportion of patients with a CHA2DS2‐VASc score ≥2 varied from 69% to 85%; oral anticoagulant treatment rates varied from 50% to 66% depending on the model. Conclusions Different electronic cohort definitions result in substantially different AF study samples. This difference threatens the quality and reproducibility of electronic medical record–based research and quality initiatives.

Published in Journal of the American Heart Association: Cardiovascular and Cerebrovascular Disease

ISSN: 2047-9980 (Online)
Publisher: Wiley
Country of publisher: United States
LCC subjects: Medicine: Internal medicine: Specialties of internal medicine: Diseases of the circulatory (Cardiovascular) system
Website: https://www.ahajournals.org/journal/jaha

About the journal

Abstract

Keywords