Computable phenotype for real-world, data-driven retrospective identification of relapse in ANCA-associated vasculitis
Vladimir Tesar,
Conor Judge,
John Kelleher,
Zdenka Hrušková,
Raashid Ahmed Luqmani,
Jennifer Scott,
Peter A Merkel,
Mark A Little,
Niall Conlon,
Louis Aslett,
Arthur White,
Julie Power,
Matthew A Rutherford,
James Ng,
Kuruvilla Sebastian,
Sorcha O’Brien,
Sarah M Moran
Affiliations
Vladimir Tesar
st Faculty of Medicine, Charles University, Prague, Czech Republic
Conor Judge
School of Medicine, College of Medicine, Nursing and Health Science, University of Galway, Galway, Ireland
John Kelleher
Department of Statistics, Dublin Institute of Technology, Dublin, Ireland
Zdenka Hrušková
st Faculty of Medicine, Charles University, Prague, Czech Republic
Raashid Ahmed Luqmani
Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Science (NDORMs), University of Oxford, Oxford, UK
Jennifer Scott
Trinity Kidney Centre, Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
Peter A Merkel
4 Division of Rheumatology, Department of Medicine, and Division of Epidemiology, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA, USA
Mark A Little
Trinity Kidney Centre, Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
Niall Conlon
Department of Immunology, St James`s Hospital, Dublin, Ireland
Louis Aslett
Department of Mathematical Science, University of Durham, Durham, UK
Arthur White
School of Computer Science and Statistics, Trinity College Dublin, Dublin, Ireland
Julie Power
Vasculitis Ireland Awareness, Dublin, Ireland
Matthew A Rutherford
School of Infection & Immunity, University of Glasgow, Glasgow, UK
James Ng
School of Computer Science and Statistics, Trinity College Dublin, Dublin, Ireland
Kuruvilla Sebastian
Trinity Kidney Centre, Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
Sorcha O’Brien
Trinity Kidney Centre, Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
Sarah M Moran
Trinity Kidney Centre, Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
Objective ANCA-associated vasculitis (AAV) is a relapsing-remitting disease, resulting in incremental tissue injury. The gold-standard relapse definition (Birmingham Vasculitis Activity Score, BVAS>0) is often missing or inaccurate in registry settings, leading to errors in ascertainment of this key outcome. We sought to create a computable phenotype (CP) to automate retrospective identification of relapse using real-world data in the research setting.Methods We studied 536 patients with AAV and >6 months follow-up recruited to the Rare Kidney Disease registry (a national longitudinal, multicentre cohort study). We followed five steps: (1) independent encounter adjudication using primary medical records to assign the ground truth, (2) selection of data elements (DEs), (3) CP development using multilevel regression modelling, (4) internal validation and (5) development of additional models to handle missingness. Cut-points were determined by maximising the F1-score. We developed a web application for CP implementation, which outputs an individualised probability of relapse.Results Development and validation datasets comprised 1209 and 377 encounters, respectively. After classifying encounters with diagnostic histopathology as relapse, we identified five key DEs; DE1: change in ANCA level, DE2: suggestive blood/urine tests, DE3: suggestive imaging, DE4: immunosuppression status, DE5: immunosuppression change. F1-score, sensitivity and specificity were 0.85 (95% CI 0.77 to 0.92), 0.89 (95% CI 0.80 to 0.99) and 0.96 (95% CI 0.93 to 0.99), respectively. Where DE5 was missing, DE2 plus either DE1/DE3 were required to match the accuracy of BVAS.Conclusions This CP accurately quantifies the individualised probability of relapse in AAV retrospectively, using objective, readily accessible registry data. This framework could be leveraged for other outcomes and relapsing diseases.