Incoherent Discriminative Dictionary Learning for Speech Enhancement

Dima Shaheen; Oumayma Al Dakkak; Mohiedin Wainakh

doi:10.26636/jtit.2018.121317

Journal of Telecommunications and Information Technology (Sep 2018)

Incoherent Discriminative Dictionary Learning for Speech Enhancement

Dima Shaheen,
Oumayma Al Dakkak ,
Mohiedin Wainakh

Affiliations

Dima Shaheen
Oumayma Al Dakkak
Mohiedin Wainakh

DOI: https://doi.org/10.26636/jtit.2018.121317
Journal volume & issue: no. 3

Abstract

Read online

Speech enhancement is one of the many challenging tasks in signal processing, especially in the case of nonstationary speech-like noise. In this paper a new incoherent discriminative dictionary learning algorithm is proposed to model both speech and noise, where the cost function accounts for both “source confusion” and “source distortion” errors, with a regularization term that penalizes the coherence between speech and noise sub-dictionaries. At the enhancement stage, we use sparse coding on the learnt dictionary to ﬁnd an estimate for both clean speech and noise amplitude spectrum. In the ﬁnal phase, the Wiener ﬁlter is used to reﬁne the clean speech estimate. Experiments on the Noizeus dataset, using two objective speech enhancement measures: frequency-weighted segmental SNR and Perceptual Evaluation of Speech Quality (PESQ) demonstrate that the proposed algorithm outperforms other speech enhancement methods tested.

Published in Journal of Telecommunications and Information Technology

ISSN: 1509-4553 (Print); 1899-8852 (Online)
Publisher: National Institute of Telecommunications
Country of publisher: Poland
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Telecommunication; Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://jtit.pl

About the journal

Abstract

Keywords