Laryngeal Pressure Estimation With a Recurrent Neural Network

Pablo Gomez; Anne Schutzenberger; Marion Semmler; Michael Dollinger

doi:10.1109/JTEHM.2018.2886021

IEEE Journal of Translational Engineering in Health and Medicine (Jan 2019)

Laryngeal Pressure Estimation With a Recurrent Neural Network

Pablo Gomez,
Anne Schutzenberger,
Marion Semmler,
Michael Dollinger

Affiliations

Pablo Gomez: ORCiD; Department of Otorhinolaryngology, Head and Neck Surgery, Division of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany
Anne Schutzenberger: Department of Otorhinolaryngology, Head and Neck Surgery, Division of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany
Marion Semmler: Department of Otorhinolaryngology, Head and Neck Surgery, Division of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany
Michael Dollinger: Department of Otorhinolaryngology, Head and Neck Surgery, Division of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany

DOI: https://doi.org/10.1109/JTEHM.2018.2886021
Journal volume & issue: Vol. 7
pp. 1 – 11

Abstract

Read online

Quantifying the physical parameters of voice production is essential for understanding the process of phonation and can aid in voice research and diagnosis. As an alternative to invasive measurements, they can be estimated by formulating an inverse problem using a numerical forward model. However, high-fidelity numerical models are often computationally too expensive for this. This paper presents a novel approach to train a long short-term memory network to estimate the subglottal pressure in the larynx at massively reduced computational cost using solely synthetic training data. We train the network on synthetic data from a numerical two-mass model and validate it on experimental data from 288 high-speed ex vivo video recordings of porcine vocal folds from a previous study. The training requires significantly fewer model evaluations compared with the previous optimization approach. On the test set, we maintain a comparable performance of 21.2% versus previous 17.7% mean absolute percentage error in estimating the subglottal pressure. The evaluation of one sample requires a vanishingly small amount of computation time. The presented approach is able to maintain estimation accuracy of the subglottal pressure at significantly reduced computational cost. The methodology is likely transferable to estimate other parameters and training with other numerical models. This improvement should allow the adoption of more sophisticated, high-fidelity numerical models of the larynx. The vast speedup is a critical step to enable a future clinical application and knowledge of parameters such as the subglottal pressure will aid in diagnosis and treatment selection.

Published in IEEE Journal of Translational Engineering in Health and Medicine

ISSN: 2168-2372 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Medicine: Medicine (General): Medical technology
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6221039

About the journal

Abstract

Keywords