Frontiers in Oncology (Aug 2024)
Development of an algorithm to identify small cell lung cancer patients in claims databases
Abstract
IntroductionThe treatment landscape of small cell lung cancer (SCLC) is evolving. Evidence generated from administrative claims is needed to characterize real-world SCLC patients. However, the current ICD-10 coding system cannot distinguish SCLC from non-small cell lung cancer (NSCLC). We developed and estimated the accuracy of an algorithm to identify SCLC in claims-only databases.MethodsWe performed a cross-sectional study of lung cancer patients diagnosed from 2016-2017 using the Surveillance, Epidemiology and End Results (SEER), linked with Medicare database. The analysis included two phases – data exploration (utilizing a 25% random sample) and data validation (remaining 75% sample). The SEER definition of SCLC and NSCLC were used as the gold standard. Claims-based algorithms were identified and evaluated for their sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV).ResultsThe eligible cohort included 31,912 lung cancer patients. The mean age was 76.3 years, 44.6% were male, with 9.4% having SCLC and 90.6% identified as NSCLC using SEER. The exploration analysis identified potential algorithms based on treatment data. In the validation analysis of 7,438 lung cancer patients who received systemic treatment in the outpatient setting, an etoposide-based algorithm (etoposide use in 180 days following lung cancer diagnosis) to identify SCLC showed: sensitivity 95%, specificity 95%, PPV 82% and NPV 99%.DiscussionAn etoposide treatment-based algorithm showed good accuracy in identifying SCLC patients. Such algorithms can facilitate analyses of treatment patterns, outcomes, healthcare resource and costs among treated SCLC patients, thereby bolstering the evidence-base for best patient care.
Keywords