Scientific Reports (Oct 2022)

Automation of generative adversarial network-based synthetic data-augmentation for maximizing the diagnostic performance with paranasal imaging

  • Hyoun-Joong Kong,
  • Jin Youp Kim,
  • Hye-Min Moon,
  • Hae Chan Park,
  • Jeong-Whun Kim,
  • Ruth Lim,
  • Jonghye Woo,
  • Georges El Fakhri,
  • Dae Woo Kim,
  • Sungwan Kim

DOI
https://doi.org/10.1038/s41598-022-22222-z
Journal volume & issue
Vol. 12, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Thus far, there have been no reported specific rules for systematically determining the appropriate augmented sample size to optimize model performance when conducting data augmentation. In this paper, we report on the feasibility of synthetic data augmentation using generative adversarial networks (GAN) by proposing an automation pipeline to find the optimal multiple of data augmentation to achieve the best deep learning-based diagnostic performance in a limited dataset. We used Waters’ view radiographs for patients diagnosed with chronic sinusitis to demonstrate the method developed herein. We demonstrate that our approach produces significantly better diagnostic performance parameters than models trained using conventional data augmentation. The deep learning method proposed in this study could be implemented to assist radiologists in improving their diagnosis. Researchers and industry workers could overcome the lack of training data by employing our proposed automation pipeline approach in GAN-based synthetic data augmentation. This is anticipated to provide new means to overcome the shortage of graphic data for algorithm training.