Hardware Efficient Speech Enhancement With Noise Aware Multi-Target Deep Learning

Salinna Abdullah; Majid Zamani; Andreas Demosthenous

doi:10.1109/OJCAS.2024.3389100

IEEE Open Journal of Circuits and Systems (Jan 2024)

Hardware Efficient Speech Enhancement With Noise Aware Multi-Target Deep Learning

Salinna Abdullah,
Majid Zamani,
Andreas Demosthenous

Affiliations

Salinna Abdullah: ORCiD; Department of Electronic and Electrical Engineering, University College London, London, U.K.
Majid Zamani: Department of Electronic and Electrical Engineering, University College London, London, U.K.
Andreas Demosthenous: ORCiD; Department of Electronic and Electrical Engineering, University College London, London, U.K.

DOI: https://doi.org/10.1109/OJCAS.2024.3389100
Journal volume & issue: Vol. 5
pp. 141 – 152

Abstract

Read online

This paper describes a supervised speech enhancement (SE) method utilising a noise-aware four-layer deep neural network and training target switching. For optimal speech denoising, the SE system, trained with multiple-target joint learning, switches between mapping-based, masking-based, or complementary processing, depending on the level of noise contamination detected. Optimisation techniques, including ternary quantisation, structural pruning, efficient sparse matrix representation and cost-effective approximations for complex computations, were implemented to reduce area, memory, and power requirements. Up to 19.1x compression was obtained, and all weights could be stored on the on-chip memory. When processing NOISEX-92 noises, the system achieved an average short-time objective intelligibility (STOI) and perceptual evaluation of speech quality (PESQ) scores of 0.81 and 1.62, respectively, outperforming SE algorithms trained with only a single learning target. The proposed SE processor was implemented on a field programmable gate array (FPGA) for proof of concept. Mapping the design on a 65-nm CMOS process led to a chip core area of $3.88~mm^{2}$ and a power consumption of 1.91 mW when operating at a 10 MHz clock frequency.

Published in IEEE Open Journal of Circuits and Systems

ISSN: 2644-1225 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electric apparatus and materials. Electric circuits. Electric networks
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=8784029

About the journal

Abstract

Keywords