BMC Health Services Research (Jun 2022)

Use of algorithms for identifying patients in a German claims database: learnings from a lung cancer case

  • Sina Neugebauer,
  • Frank Griesinger,
  • Sabine Dippel,
  • Stephanie Heidenreich,
  • Nina Gruber,
  • Detlef Chruscz,
  • Sebastian Lempfert,
  • Peter Kaskel

DOI
https://doi.org/10.1186/s12913-022-07982-8
Journal volume & issue
Vol. 22, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Background The analysis of statutory health insurance (SHI) data is a little-used approach for understanding treatment and care as well as resource use of lung cancer (LC) patients in Germany. The aims of this observational, retrospective, longitudinal analysis of structured data were to analyze the healthcare situation of LC patients in Germany based on routine data from SHI funds, to develop an algorithm that sheds light on LC types (non-small cell / NSCLC vs. small cell / SCLC), and to gain new knowledge to improve needs-based care. Methods Anonymized billing data of approximately four million people with SHI were analyzed regarding ICD-10 (German modification), documented medical interventions based on the outpatient SHI Uniform Assessment Standard Tariff (EBM) or the inpatient Operations and Procedure Code (OPS), and the dispensing of prescription drugs to outpatients (ATC classification). The study included patients who were members of 64 SHI funds between Jan-1st, 2015 and Dec-31st, 2016 and who received the initial diagnosis of LC in 2015 and 2016. Results The analysis shows that neither the cancer type nor the cancer stage can be unambiguously described by the ICD-10 coding. Furthermore, an assignment based on the prescribed medication provides only limited information: many of the drugs are either approved for both LC types or are used off-label, making it difficult to assign them to a specific LC type. Overall, 25% of the LC patients were unambiguously identifiable as NSCLC vs SCLC based on the ICD-10 code, the drug therapy, and the billing data. Conclusions The current coding system appears to be of limited suitability for drawing conclusions about LC and therefore the SHI patient population. This makes it difficult to analyze the healthcare data with the aim of gathering new knowledge to improve needs-based care. The approach chosen for this study did not allow for development of a LC differentiation algorithm based on the available healthcare data. However, a better overview of patient specific needs could make it possible to modify the range of services provided by the SHI funds. From this perspective, it makes sense, in a first step, to refine the ICD-10 system to facilitate NSCLC vs. SCLC classification.

Keywords