Speech Enhancement using Greedy Dictionary Learning and Sparse Recovery

K. N. H. Srinivas; I. Santhi Prabha; M. Venugopala Rao

doi:10.22059/jitm.2022.89415

Journal of Information Technology Management (Jan 2023)

Speech Enhancement using Greedy Dictionary Learning and Sparse Recovery

K. N. H. Srinivas,
I. Santhi Prabha,
M. Venugopala Rao

Affiliations

K. N. H. Srinivas: Research Scholar, ECE Department, JNTUK, Kakinada, India.
I. Santhi Prabha: Professor, ECE Department, JNTUK, Kakinada, India.
M. Venugopala Rao: Professor, ECE Department, K. L. University, Guntur, India.

DOI: https://doi.org/10.22059/jitm.2022.89415
Journal volume & issue: Vol. 15, no. Special Issue
pp. 120 – 132

Abstract

Read online

Most real-time speech signals are frequently disrupted by noise such as traffic, babbling, and background noises, among other things. The goal of speech denoising is to extract the clean speech signal from as many distorted components as possible. For speech denoising, many researchers worked on sparse representation and dictionary learning algorithms. These algorithms, however, have many disadvantages, including being overcomplete, computationally expensive, and susceptible to orthogonality restrictions, as well as a lack of arithmetic precision due to the usage of double-precision. We propose a greedy technique for dictionary learning with sparse representation to overcome these concerns. In this technique, the input signal's singular value decomposition is used to exploit orthogonality, and here the ℓ1-ℓ2 norm is employed to obtain sparsity to learn the dictionary. It improves dictionary learning by overcoming the orthogonality constraint, the three-sigma rule-based number of iterations, and the overcomplete nature. And this technique has resulted in improved performance as well as reduced computing complexity. With a bit-precision of Q7 fixed-point arithmetic, this approach is also used in resource-constrained embedded systems, and the performance is considerably better than other algorithms. The greedy approach outperforms the other two in terms of SNR, Short-Time Objective Intelligibility, and computing time.

Published in Journal of Information Technology Management

ISSN: 2008-5893 (Print); 2423-5059 (Online)
Publisher: University of Tehran
Country of publisher: Iran, Islamic Republic of
LCC subjects: Bibliography. Library science. Information resources: Information resources (General)
Website: https://jitm.ut.ac.ir/

About the journal

Abstract

Keywords