Diagnostics (Jan 2025)

Erroneous Classification and Coding as a Limitation for Big Data Analyses: Causes and Impacts Illustrated by the Diagnosis of Clavicle Injuries

  • Robert Raché,
  • Lara-Sophie Claudé,
  • Marcus Vollmer,
  • Lyubomir Haralambiev,
  • Denis Gümbel,
  • Axel Ekkernkamp,
  • Martin Jordan,
  • Stefan Schulz-Drost,
  • Mustafa Sinan Bakir

DOI
https://doi.org/10.3390/diagnostics15020131
Journal volume & issue
Vol. 15, no. 2
p. 131

Abstract

Read online

Background/Objectives: Clavicle injuries are common and seem to be frequently subject to diagnostic misclassification. The accurate identification of clavicle fractures is essential, particularly for registry and Big Data analyses. This study aims to assess the frequency of diagnostic errors in clavicle injury classifications. Methods: This retrospective study analyzed patient data from two Level 1 trauma centers, covering the period from 2008 to 2019. Included were cases with ICD-coded diagnoses of medial, midshaft, and lateral clavicle fractures, as well as sternoclavicular and acromioclavicular joint dislocations. Radiological images were re-evaluated, and discharge summaries, radiological reports, and billing codes were examined for diagnostic accuracy. Results: A total of 1503 patients were included, accounting for 1855 initial injury diagnoses. In contrast, 1846 were detected upon review. Initially, 14.4% of cases were coded as medial clavicle fractures, whereas only 5.2% were confirmed. The misclassification rate was 82.8% for initial medial fractures (p p p p Conclusions: Our findings indicate that diagnostic misclassification of clavicle fractures is common, particularly between medial and midshaft fractures, often resulting from errors in multiple categories. Further prospective studies are needed, as accurate classification is foundational for the reliable application of Big Data and AI-based analyses in clinical research.

Keywords