BMJ Open (Feb 2024)

Feasibility of extracting cancer stage and metastasis codes from health insurance claims of outpatients and expressibility in ICD-11: a cross-sectional study using national health insurance data from South Korea

  • Chris Lane,
  • Young-Taek Park,
  • Dongwoon Han,
  • Kyoung-Hoon Kim,
  • Hoguen Kim,
  • Hojung Joseph Yoon,
  • Byeo-Ri Kim,
  • Joo-Yeon Jeong

DOI
https://doi.org/10.1136/bmjopen-2023-073952
Journal volume & issue
Vol. 14, no. 2

Abstract

Read online

Objectives This study aimed to evaluate the incidence of health insurance claims recording the cancer stage and TNM codes representing tumor extension size (T), lymph node metastasis (N), and distant metastasis (M) for patients diagnosed with cancer and to determine whether this extracted data could be applied to the new ICD-11 codes.Design A cross-sectional study design was used, with the units of analysis as individual outpatients. Two dependent variables were extraction feasibility of cancer stage and TNM metastasis information from each claim. Expressibility of the two variables in ICD-11 was descriptively analysed.Setting and participants The study was conducted in South Korea and study participants were outpatients: lung cancer (LC) (46616), stomach cancer (SC) (50103) and colorectal cancer (CC) (54707). The data set consisted of the first health insurance claim of each patient visiting a hospital from 1 July to 31 December 2021.Results The absolute extraction success rates for cancer stage based on claims with cancer stage was 33.3%. The rates for stage for LC, SC and CC were 30.1%, 35.5% and 34.0%, respectively. The rate for TNM was 11.0%. The relative extraction success rates for stage compared with that for CC (the reference group) were lower for patients with LC (adjusted OR (aOR), 0.803; 95% CI 0.782 to 0.825; p<0.0001) but higher for SC (aOR 1.073; 95% CI 1.046 to 1.101; p<0.0001). The rates of TNM compared that for CC were 40.7% lower for LC (aOR, 0.593; 95% CI 0.569 to 0.617; p<0.0001) and 43.0% lower for SC (aOR 0.570; 95% CI 0.548 to 0.593; p<0.0001). There were limits to expressibility in ICD-11 regarding the detailed cancer stage and TNM metastasis codes.Conclusion Extracting cancer stage and TNM codes from health insurance claims were feasible, but expressibility in ICD-11 codes was limited. WHO may need to create specific cancer stage and TNM extension codes for ICD-11 due to the absence of current rules in ICD-11.