Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimization

Elisa Tengan; Thomas Dietzen; Filip Elvander; Toon van Waterschoot

doi:10.1186/s13636-023-00304-8

EURASIP Journal on Audio, Speech, and Music Processing (Oct 2023)

Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimization

Elisa Tengan,
Thomas Dietzen,
Filip Elvander,
Toon van Waterschoot

Affiliations

Elisa Tengan: Department of Electrical Engineering (ESAT/STADIUS), KU Leuven
Thomas Dietzen: Department of Electrical Engineering (ESAT/STADIUS), KU Leuven
Filip Elvander: Department of Information and Communications Engineering, Aalto University
Toon van Waterschoot: Department of Electrical Engineering (ESAT/STADIUS), KU Leuven

DOI: https://doi.org/10.1186/s13636-023-00304-8
Journal volume & issue: Vol. 2023, no. 1
pp. 1 – 20

Abstract

Read online

Abstract In this paper, two approaches are proposed for estimating the direction of arrival (DOA) and power spectral density (PSD) of stationary point sources by using a single, rotating, directional microphone. These approaches are based on a method previously presented by the authors, in which point source DOAs were estimated by using a broadband signal model and solving a group-sparse optimization problem, where the number of observations made by the rotating directional microphone can be lower than the number of candidate DOAs in an angular grid. The DOA estimation is followed by the estimation of the sources’ PSDs through the solution of an overdetermined least squares problem. The first approach proposed in this paper includes the use of an additional nonnegativity constraint on the residual noise term when solving the group-sparse optimization problem and is referred to as the Group Lasso Least Squares (GL-LS) approach. The second proposed approach, in addition to the new nonnegativity constraint, employs a narrowband signal model when building the linear system of equations used for formulating the group-sparse optimization problem, where the DOAs and PSDs can be jointly estimated by iterative, group-wise reweighting. This is referred to as the Group-Lasso with $$l_1$$ l 1 -reweighting (GL-L1) approach. Both proposed approaches are implemented using the alternating direction method of multipliers (ADMM), and their performance is evaluated through simulations in which different setup conditions are considered, ranging from different types of model mismatch to variations in the acoustic scene and microphone directivity pattern. The results obtained show that in a scenario involving a microphone response mismatch between observed data and the signal model used, having the additional nonnegativity constraint on the residual noise can improve the DOA estimation for the case of GL-LS and the PSD estimation for the case of GL-L1. Moreover, the GL-L1 approach can present an advantage over GL-LS in terms of DOA estimation performance in scenarios with low SNR or where multiple sources are closely located to each other. Finally, it is shown that having the least squares PSD re-estimation step is beneficial in most scenarios, such that GL-LS outperformed GL-L1 in terms of PSD estimation errors.

Published in EURASIP Journal on Audio, Speech, and Music Processing

ISSN: 1687-4722 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Science: Physics: Acoustics. Sound; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://asmp-eurasipjournals.springeropen.com

About the journal

Abstract

Keywords