Intra-database validation of case-identifying algorithms using reconstituted electronic health records from healthcare claims data

Nicolas H. Thurin; Pauline Bosco-Levy; Patrick Blin; Magali Rouyer; Jérémy Jové; Stéphanie Lamarque; Séverine Lignot; Régis Lassalle; Abdelilah Abouelfath; Emmanuelle Bignon; Pauline Diez; Marine Gross-Goupil; Michel Soulié; Mathieu Roumiguié; Sylvestre Le Moulec; Marc Debouverie; Bruno Brochet; Francis Guillemin; Céline Louapre; Elisabeth Maillart; Olivier Heinzlef; Nicholas Moore; Cécile Droz-Perroteau

doi:10.1186/s12874-021-01285-y

BMC Medical Research Methodology (May 2021)

Intra-database validation of case-identifying algorithms using reconstituted electronic health records from healthcare claims data

Nicolas H. Thurin,
Pauline Bosco-Levy,
Patrick Blin,
Magali Rouyer,
Jérémy Jové,
Stéphanie Lamarque,
Séverine Lignot,
Régis Lassalle,
Abdelilah Abouelfath,
Emmanuelle Bignon,
Pauline Diez,
Marine Gross-Goupil,
Michel Soulié,
Mathieu Roumiguié,
Sylvestre Le Moulec,
Marc Debouverie,
Bruno Brochet,
Francis Guillemin,
Céline Louapre,
Elisabeth Maillart,
Olivier Heinzlef,
Nicholas Moore,
Cécile Droz-Perroteau

Affiliations

Nicolas H. Thurin: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Pauline Bosco-Levy: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Patrick Blin: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Magali Rouyer: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Jérémy Jové: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Stéphanie Lamarque: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Séverine Lignot: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Régis Lassalle: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Abdelilah Abouelfath: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Emmanuelle Bignon: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Pauline Diez: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Marine Gross-Goupil: Department of Medical Oncology, Hôpital Saint André, CHU de Bordeaux
Michel Soulié: Department of Urology, University Hospital of Rangueil, CHU de Toulouse
Mathieu Roumiguié: Department of Urology, University Hospital of Rangueil, CHU de Toulouse
Sylvestre Le Moulec: Department of Oncology, Clinique Marzet
Marc Debouverie: Department of Neurology, CHRU de Nancy
Bruno Brochet: CRC SEP, Neurology Department, CHU de Bordeaux
Francis Guillemin: Université de Lorraine, EA 4360 APEMAC
Céline Louapre: Sorbonne Université, Institut du cerveau, ICM, Hôpital de la Pitié Salpêtrière, INSERM UMR S 1127, CNRS UMR 7225
Elisabeth Maillart: Neurology Department, Hôpital de la Pitié Salpêtrière, APHP
Olivier Heinzlef: Department of Neurology, Hôpital CHI de Poissy/Saint-Germain-en-Laye
Nicholas Moore: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux
Cécile Droz-Perroteau: INSERM CIC-P1401, Bordeaux PharmacoEpi, Univ. Bordeaux

DOI: https://doi.org/10.1186/s12874-021-01285-y
Journal volume & issue: Vol. 21, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Background Diagnosis performances of case-identifying algorithms developed in healthcare database are usually assessed by comparing identified cases with an external data source. When this is not feasible, intra-database validation can present an appropriate alternative. Objectives To illustrate through two practical examples how to perform intra-database validations of case-identifying algorithms using reconstituted Electronic Health Records (rEHRs). Methods Patients with 1) multiple sclerosis (MS) relapses and 2) metastatic castration-resistant prostate cancer (mCRPC) were identified in the French nationwide healthcare database (SNDS) using two case-identifying algorithms. A validation study was then conducted to estimate diagnostic performances of these algorithms through the calculation of their positive predictive value (PPV) and negative predictive value (NPV). To that end, anonymized rEHRs were generated based on the overall information captured in the SNDS over time (e.g. procedure, hospital stays, drug dispensing, medical visits) for a random selection of patients identified as cases or non-cases according to the predefined algorithms. For each disease, an independent validation committee reviewed the rEHRs of 100 cases and 100 non-cases in order to adjudicate on the status of the selected patients (true case/ true non-case), blinded with respect to the result of the corresponding algorithm. Results Algorithm for relapses identification in MS showed a 95% PPV and 100% NPV. Algorithm for mCRPC identification showed a 97% PPV and 99% NPV. Conclusion The use of rEHRs to conduct an intra-database validation appears to be a valuable tool to estimate the performances of a case-identifying algorithm and assess its validity, in the absence of alternative.

Published in BMC Medical Research Methodology

ISSN: 1471-2288 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General)
Website: http://bmcmedresmethodol.biomedcentral.com

About the journal

Abstract

Keywords