Improved tactile speech robustness to background noise with a dual-path recurrent neural network noise-reduction method

Mark D. Fletcher; Samuel W. Perry; Iordanis Thoidis; Carl A. Verschuur; Tobias Goehring

doi:10.1038/s41598-024-57312-7

Scientific Reports (Mar 2024)

Improved tactile speech robustness to background noise with a dual-path recurrent neural network noise-reduction method

Mark D. Fletcher,
Samuel W. Perry,
Iordanis Thoidis,
Carl A. Verschuur,
Tobias Goehring

Affiliations

Mark D. Fletcher: University of Southampton Auditory Implant Service, University of Southampton
Samuel W. Perry: University of Southampton Auditory Implant Service, University of Southampton
Iordanis Thoidis: School of Electrical and Computer Engineering, Aristotle University of Thessaloniki
Carl A. Verschuur: University of Southampton Auditory Implant Service, University of Southampton
Tobias Goehring: MRC Cognition and Brain Sciences Unit, University of Cambridge

DOI: https://doi.org/10.1038/s41598-024-57312-7
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Many people with hearing loss struggle to understand speech in noisy environments, making noise robustness critical for hearing-assistive devices. Recently developed haptic hearing aids, which convert audio to vibration, can improve speech-in-noise performance for cochlear implant (CI) users and assist those unable to access hearing-assistive devices. They are typically body-worn rather than head-mounted, allowing additional space for batteries and microprocessors, and so can deploy more sophisticated noise-reduction techniques. The current study assessed whether a real-time-feasible dual-path recurrent neural network (DPRNN) can improve tactile speech-in-noise performance. Audio was converted to vibration on the wrist using a vocoder method, either with or without noise reduction. Performance was tested for speech in a multi-talker noise (recorded at a party) with a 2.5-dB signal-to-noise ratio. An objective assessment showed the DPRNN improved the scale-invariant signal-to-distortion ratio by 8.6 dB and substantially outperformed traditional noise-reduction (log-MMSE). A behavioural assessment in 16 participants showed the DPRNN improved tactile-only sentence identification in noise by 8.2%. This suggests that advanced techniques like the DPRNN could substantially improve outcomes with haptic hearing aids. Low-cost haptic devices could soon be an important supplement to hearing-assistive devices such as CIs or offer an alternative for people who cannot access CI technology.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal