Frontiers in Bioengineering and Biotechnology (Jan 2016)

Voice pathology detection using modulation spectrum optimized metrics

  • Laureano eMoro-Velazquez,
  • Jorge Andrés Gómez-García,
  • Juan Ignacio Godino-Llorente

DOI
https://doi.org/10.3389/fbioe.2016.00001
Journal volume & issue
Vol. 4

Abstract

Read online

There exist many acoustic parameters employed for pathological assessment tasks, which have served as tools for clinicians to distinguish between normophonic and pathological voices. However, many of these parameters require an appropriate tuning in order to maximize its efficiency.In this work, a group of new and already proposed modulation spectrum metrics are optimized considering different time and frequency ranges pursuing the maximization of efficiency for the detection of pathological voices.The optimization of the metrics is performed simultaneously in two different voice databases in order to identify what tuning ranges produce a better generalization. The experiments were cross-validated so as to ensure the validity of the results. A third database is used to test the optimized metrics.In spite of some differences, results indicate that the behavior of the metrics in the optimization process follows similar tendencies for the tuning databases, confirming the generalization capabilities of the proposed modulation spectrum metrics. In addition, the tuning process reveals which bands of the modulation spectra have relevant information for each metric, which has a physical interpretation respecting the phonatory system. Efficiency values up to 90.6 % are obtained in one tuning database while in the other the maximum efficiency reaches 71.1 %.Obtained results also evidence a separability between normophonic and pathological states using the proposed metrics, which can be exploited for voice pathology detection or assessment.

Keywords