Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features

Shing-Yun Jung; Chia-Hung Liao; Yu-Sheng Wu; Shyan-Ming Yuan; Chuen-Tsai Sun

doi:10.3390/diagnostics11040732

Diagnostics (Apr 2021)

Efficiently Classifying Lung Sounds through Depthwise Separable CNN Models with Fused STFT and MFCC Features

Shing-Yun Jung,
Chia-Hung Liao,
Yu-Sheng Wu,
Shyan-Ming Yuan,
Chuen-Tsai Sun

Affiliations

Shing-Yun Jung: Department of Computer Science, National Chiao Tung University, Hsinchu 300, Taiwan
Chia-Hung Liao: Department of Computer Science, National Chiao Tung University, Hsinchu 300, Taiwan
Yu-Sheng Wu: Department of Computer Science, National Chiao Tung University, Hsinchu 300, Taiwan
Shyan-Ming Yuan: Department of Computer Science, National Chiao Tung University, Hsinchu 300, Taiwan
Chuen-Tsai Sun: Department of Computer Science, National Chiao Tung University, Hsinchu 300, Taiwan

DOI: https://doi.org/10.3390/diagnostics11040732
Journal volume & issue: Vol. 11, no. 4
p. 732

Abstract

Read online

Lung sounds remain vital in clinical diagnosis as they reveal associations with pulmonary pathologies. With COVID-19 spreading across the world, it has become more pressing for medical professionals to better leverage artificial intelligence for faster and more accurate lung auscultation. This research aims to propose a feature engineering process that extracts the dedicated features for the depthwise separable convolution neural network (DS-CNN) to classify lung sounds accurately and efficiently. We extracted a total of three features for the shrunk DS-CNN model: the short-time Fourier-transformed (STFT) feature, the Mel-frequency cepstrum coefficient (MFCC) feature, and the fused features of these two. We observed that while DS-CNN models trained on either the STFT or the MFCC feature achieved an accuracy of 82.27% and 73.02%, respectively, fusing both features led to a higher accuracy of 85.74%. In addition, our method achieved 16 times higher inference speed on an edge device and only 0.45% less accuracy than RespireNet. This finding indicates that the fusion of the STFT and MFCC features and DS-CNN would be a model design for lightweight edge devices to achieve accurate AI-aided detection of lung diseases.

Published in Diagnostics

ISSN: 2075-4418 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Medicine (General)
Website: http://www.mdpi.com/journal/diagnostics

About the journal

Abstract

Keywords