Single-channel speech enhancement based on joint constrained dictionary learning

Linhui Sun; Yunyi Bu; Pingan Li; Zihao Wu

doi:10.1186/s13636-021-00218-3

EURASIP Journal on Audio, Speech, and Music Processing (Jul 2021)

Single-channel speech enhancement based on joint constrained dictionary learning

Linhui Sun,
Yunyi Bu,
Pingan Li,
Zihao Wu

Affiliations

Linhui Sun: College of Telecommunications & Information Engineering, Nanjing University of Posts and Telecommunications
Yunyi Bu: College of Telecommunications & Information Engineering, Nanjing University of Posts and Telecommunications
Pingan Li: College of Telecommunications & Information Engineering, Nanjing University of Posts and Telecommunications
Zihao Wu: College of Telecommunications & Information Engineering, Nanjing University of Posts and Telecommunications

DOI: https://doi.org/10.1186/s13636-021-00218-3
Journal volume & issue: Vol. 2021, no. 1
pp. 1 – 14

Abstract

Read online

Abstract To improve the performance of speech enhancement in a complex noise environment, a joint constrained dictionary learning method for single-channel speech enhancement is proposed, which solves the “cross projection” problem of signals in the joint dictionary. In the method, the new optimization function not only constrains the sparse representation of the noisy signal in the joint dictionary, and controls the projection error of the speech signal and noise signal on the corresponding sub-dictionary, but also minimizes the cross projection error and the correlation between the sub-dictionaries. In addition, the adjustment factors are introduced to balance the weight of constraint terms to obtain the joint dictionary more discriminatively. When the method is applied to the single-channel speech enhancement, speech components of the noisy signal can be more projected onto the clean speech sub-dictionary of the joint dictionary without being affected by the noise sub-dictionary, which makes the quality and intelligibility of the enhanced speech higher. The experimental results verify that our algorithm has better performance than the speech enhancement algorithm based on discriminative dictionary learning under white noise and colored noise environments in time domain waveform, spectrogram, global signal-to-noise ratio, subjective evaluation of speech quality, and logarithmic spectrum distance.

Published in EURASIP Journal on Audio, Speech, and Music Processing

ISSN: 1687-4722 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Science: Physics: Acoustics. Sound; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://asmp-eurasipjournals.springeropen.com

About the journal

Abstract

Keywords