Frontiers in Medicine (Sep 2025)

Severity assessment of COVID-19 disease: radiological visual score versus automated quantitative CT parameters using a pneumonia analysis algorithm

  • Dario Baldi,
  • Bruna Punzo,
  • Andrea Colacino,
  • Marco Aiello,
  • Monica Franzese,
  • Ludovico La Grutta,
  • Luca Saba,
  • Eduardo Bossone,
  • Emanuela Passaro,
  • Cesare Mantini,
  • Carlo Cavaliere,
  • Sergio Berti,
  • Simona Celi,
  • Antonella Meloni,
  • Filippo Cademartiri,
  • Erica Maffei

DOI
https://doi.org/10.3389/fmed.2025.1606771
Journal volume & issue
Vol. 12

Abstract

Read online

ObjectiveEvaluate the impact of the use of an AI-assisted quantitative tool for assessing stratification of patients with acute lung involvement from coronavirus (COVID-19) compared to a semi-quantitative visual score made by the radiologist.MethodsWe retrospectively enrolled 611 patients with respiratory distress and suspected pneumonia admitted between 27 February and 27 April 2020. Demographic, imaging, and clinical data were collected. Lung involvement was visually assessed using a 5-class severity scale and compared with automated AI-based CT analysis (CT Pneumonia Analysis 2.5.2, SyngoVIA Siemens), which quantified volume and density of alterations. Patients were assigned to severity classes for concordance analysis. Subgroup analysis across biweekly periods assessed changes in visual rater performance. Correlation with SpO₂ and diagnostic performance (accuracy, sensitivity, specificity, AUC) of both methods in predicting RT-PCR results were evaluated.ResultsHigh concordance was found between visual and quantitative assessments (k = 0.73, p < 0.001), with most discordances in low-severity (classes 0–1, k = 0.71), while agreement was excellent for higher severity (classes 2–4, k = 0.91). Misclassifications were mainly for mild cases; concordance was strong in severe, life-threatening presentations. Temporal analysis showed a progressive improvement in agreement over time (k = 0.62, 0.61, 0.54, 0.73). A mild but significant negative correlation emerged between quantitative assessment and SpO₂ values (r = −0.13, p = 0.02). Diagnostic performance between the two methods was similar: visual (AUC = 0.55, Accuracy = 44%, Sensitivity = 27%, Specificity = 78%) and quantitative (AUC = 0.56, Accuracy = 45%, Sensitivity = 27%, Specificity = 79%). Neither method showed strong predictive power for RT-PCR COVID-19 positivity. Nonetheless, assessing lung involvement remains essential for managing respiratory distress, regardless of confirmed infection, particularly for identifying patients with >25% parenchymal involvement who may require hospitalization.ConclusionVisual and AI-based CT assessments showed high concordance and similar accuracy, especially in patients with >25% lung involvement. This study demonstrates the utility of AI-based algorithms to improve the diagnostic efficiency and the reliability, highlighting their value in routine COVID-19 pneumonia evaluation and management.

Keywords