Clinical evaluation of code‐based algorithms to identify patients with pulmonary arterial hypertension in healthcare databases

Eva‐Maria Didden; Di Lu; Andrew Hsi; Monika Brand; Haley Hedlin; Roham T. Zamanian

doi:10.1002/pul2.12333

Pulmonary Circulation (Jan 2024)

Clinical evaluation of code‐based algorithms to identify patients with pulmonary arterial hypertension in healthcare databases

Eva‐Maria Didden,
Di Lu,
Andrew Hsi,
Monika Brand,
Haley Hedlin,
Roham T. Zamanian

Affiliations

Eva‐Maria Didden: Global Epidemiology, Rare Disease Epicenter, Actelion Pharmaceuticals Ltd Janssen Pharmaceutical Company of Johnson & Johnson Allschwil Switzerland
Di Lu: Quantitative Sciences Unit Stanford University Stanford California USA
Andrew Hsi: Adult PH Program Vera Moulton Wall Center University Stanford California USA
Monika Brand: Global Epidemiology, Rare Disease Epicenter, Actelion Pharmaceuticals Ltd Janssen Pharmaceutical Company of Johnson & Johnson Allschwil Switzerland
Haley Hedlin: Quantitative Sciences Unit Stanford University Stanford California USA
Roham T. Zamanian: Adult PH Program Vera Moulton Wall Center University Stanford California USA

DOI: https://doi.org/10.1002/pul2.12333
Journal volume & issue: Vol. 14, no. 1
pp. n/a – n/a

Abstract

Read online

Abstract Pulmonary arterial hypertension (PAH) is a rare subgroup of pulmonary hypertension (PH). Claims and administrative databases can be particularly important for research in rare diseases; however, there is a lack of validated algorithms to identify PAH patients using administrative codes. We aimed to measure the accuracy of code‐based PAH algorithms against the true clinical diagnosis by right heart catheterization (RHC). This study evaluated algorithms in patients who were recorded in two linkable data assets: the Stanford Healthcare administrative electronic health record database and the Stanford Vera Moulton Wall Center clinical PH database (which records each patient's RHC diagnosis). We assessed the sensitivity and specificity achieved by 16 algorithms (six published). In total, 720 PH patients with linked data available were included and 558 (78%) of these were PAH patients. Algorithms consisting solely of a P(A)H‐specific diagnostic code classed all or almost all PH patients as PAH (sensitivity >97%, specificity <12%) while multicomponent algorithms with well‐defined temporal sequences of procedure, diagnosis and treatment codes achieved a better balance of sensitivity and specificity. Specificity increased and sensitivity decreased with increasing algorithm complexity. The best‐performing algorithms, in terms of fewest misclassified patients, included multiple components (e.g., PH diagnosis, PAH treatment, continuous enrollment for ≥6 months before and ≥12 months following index date) and achieved sensitivities and specificities of around 95% and 38%, respectively. Our findings help researchers tailor their choice and design of code‐based PAH algorithms to their research question and demonstrate the importance of including well‐defined temporal components in the algorithms.

Published in Pulmonary Circulation

ISSN: 2045-8940 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Internal medicine: Specialties of internal medicine: Diseases of the circulatory (Cardiovascular) system; Medicine: Internal medicine: Specialties of internal medicine: Diseases of the respiratory system
Website: https://onlinelibrary.wiley.com/journal/20458940

About the journal

Abstract

Keywords