Journal of Clinical and Translational Science (Apr 2023)

331 Reusing EHR Phenotyping Algorithms in Practice: Developing the Colorado Diabetes EHR Research Repository (CODER)

  • Melissa P. Wilson,
  • David A. Mayer,
  • Luke V. Rasmussen,
  • Pramod Khanal,
  • Maryam Nuriyeva,
  • Michael McRae,
  • Sridharan Raghavan,
  • Laura K. Wiley

DOI
https://doi.org/10.1017/cts.2023.378
Journal volume & issue
Vol. 7
pp. 98 – 99

Abstract

Read online

OBJECTIVES/GOALS: The rates of computational phenotyping algorithm reuse across health systems are low, leading to a proliferation of algorithms for the same trait. We propose a framework for reusing computational phenotyping algorithms and describe the real-world deployment of this framework for the development of the Colorado Diabetes EHR Research Repository. METHODS/STUDY POPULATION: The novel phenotype reuse framework consists of 4 steps: select algorithms that are appropriate for reuse by assessing whether they are fit for purpose; extend the algorithm to account for changes in data and care practice standards; localize the algorithm to use local database standards and terminologies; optimize the algorithm by applying a data driven approach to achieve the desired local performance. To identify individuals with type 1 diabetes (T1D) or type 2 diabetes (T2D), we selected and implemented T2D algorithms in a cohort of adults with any diabetes or pre-diabetes related diagnosis code, medication, or abnormal glucose-related laboratory test in the clinical data warehouse for UCHealth and the University of Colorado. RESULTS/ANTICIPATED RESULTS: We included a total of 926,290 patients who were identified by initial filters. Patients were more likely to be female (53%), identify as non-Hispanic white (69%) and had a median age of 58 years (IQR: 41, 70). Implementation, extension, localization, & optimization through iterative chart review prioritized high sensitivity for all-cause diabetes and high specificity for T1D and T2D. Of the original cohort, 252,946 (27%) were identified by the all-cause diabetes algorithm. Of these 11,688 were identified as T1D and 135,588 as T2D. After optimization the all-cause diabetes algorithm had 88% sensitivity, 90% specificity, 74% positive predictive value (PPV), and 96% negative predictive value (NPV). Our algorithms for T1D and T2D had high specificity (100% and 99%, respectively) and PPV (100 and 96% respectively). DISCUSSION/SIGNIFICANCE: Developing computational phenotyping algorithms is expensive and time consuming, yet algorithm reuse is low due to a lack of practical approaches for reusing algorithms. We demonstrate application of a novel framework for algorithm reuse, yielding good alignment of algorithm performance with study goals for identifying individuals with diabetes.