PLoS ONE (Jan 2024)

CNNs trained with adult data are useful in pediatrics. A pneumonia classification example.

  • Maria Rollan-Martinez-Herrera,
  • Alejandro A Díaz,
  • Rubén San José Estépar,
  • Gonzalo Vegas Sanchez-Ferrero,
  • James C Ross,
  • Raúl San José Estépar,
  • Pietro Nardelli

DOI
https://doi.org/10.1371/journal.pone.0306703
Journal volume & issue
Vol. 19, no. 7
p. e0306703

Abstract

Read online

Background and objectivesThe scarcity of data for training deep learning models in pediatrics has prompted questions about the feasibility of employing CNNs trained with adult images for pediatric populations. In this work, a pneumonia classification CNN was used as an exploratory example to showcase the adaptability and efficacy of such models in pediatric healthcare settings despite the inherent data constraints.MethodsTo develop a curated training dataset with reduced biases, 46,947 chest X-ray images from various adult datasets were meticulously selected. Two preprocessing approaches were tried to assess the impact of thoracic segmentation on model attention outside the thoracic area. Evaluation of our approach was carried out on a dataset containing 5,856 chest X-rays of children from 1 to 5 years old.ResultsAn analysis of attention maps indicated that networks trained with thorax segmentation placed less attention on regions outside the thorax, thus eliminating potential bias. The ensuing network exhibited impressive performance when evaluated on an adult dataset, achieving a pneumonia discrimination AUC of 0.95. When tested on a pediatric dataset, the pneumonia discrimination AUC reached 0.82.ConclusionsThe results of this study show that adult-trained CNNs can be effectively applied to pediatric populations. This could potentially shift focus towards validating adult models over pediatric population instead of training new CNNs with limited pediatric data. To ensure the generalizability of deep learning models, it is important to implement techniques aimed at minimizing biases, such as image segmentation or low-quality image exclusion.