Applied Sciences (Jul 2022)

A Progressively Expanded Database for Automated Lung Sound Analysis: An Update

  • Fu-Shun Hsu,
  • Shang-Ran Huang,
  • Chien-Wen Huang,
  • Yuan-Ren Cheng,
  • Chun-Chieh Chen,
  • Jack Hsiao,
  • Chung-Wei Chen,
  • Feipei Lai

DOI
https://doi.org/10.3390/app12157623
Journal volume & issue
Vol. 12, no. 15
p. 7623

Abstract

Read online

We previously established an open-access lung sound database, HF_Lung_V1, and developed deep learning models for inhalation, exhalation, continuous adventitious sound (CAS), and discontinuous adventitious sound (DAS) detection. The amount of data used for training contributes to model accuracy. In this study, we collected larger quantities of data to further improve model performance and explored issues of noisy labels and overlapping sounds. HF_Lung_V1 was expanded to HF_Lung_V2 with a 1.43× increase in the number of audio files. Convolutional neural network–bidirectional gated recurrent unit network models were trained separately using the HF_Lung_V1 (V1_Train) and HF_Lung_V2 (V2_Train) training sets. These were tested using the HF_Lung_V1 (V1_Test) and HF_Lung_V2 (V2_Test) test sets, respectively. Segment and event detection performance was evaluated. Label quality was assessed. Overlap ratios were computed between inhalation, exhalation, CAS, and DAS labels. The model trained using V2_Train exhibited improved performance in inhalation, exhalation, CAS, and DAS detection on both V1_Test and V2_Test. Poor CAS detection was attributed to the quality of CAS labels. DAS detection was strongly influenced by the overlapping of DAS with inhalation and exhalation. In conclusion, collecting greater quantities of lung sound data is vital for developing more accurate lung sound analysis models.

Keywords