IET Communications (Jun 2021)

Multi‐channel underdetermined blind source separation for recorded audio mixture signals using an unmanned aerial vehicle

  • Kan Xie,
  • Kanyang Jiang,
  • Qiyu Yang

DOI
https://doi.org/10.1049/cmu2.12109
Journal volume & issue
Vol. 15, no. 10
pp. 1412 – 1422

Abstract

Read online

Abstract Unmanned aerial vehicles as an important role for 5G and beyond networks are becoming more and more popular and have been equipped with various sensors to enable diverse emerging applications, e.g. locating sound‐emitting targets. Multi‐channel blind source separation algorithm has been applied into the unmanned aerial vehicles and micro aerial vehicles, where underdetermined mixture blind source separation is a challenging problem, i.e. the number of sources is more than the number of microphones. An optimization underdetermined blind source separation algorithm to separate the multi‐channel audio mixture signals recorded by an unmanned aerial vehicle is proposed. In the algorithm, firstly a hierarchical clustering to estimate channel as the mixing matrix initialization is employed, while using direction of arrival permutation algorithm to deal with the permutation alignment and update the mixing matrix using multiplication update method. Then the model parameters are estimated using improved expectation‐maximization update rules for the fast convergence. Finally, the frequency‐domain sources are estimated through Wiener filtering and time‐domain sources are obtained via inverse short‐time Fourier transform. Experimental results covering synthetic and real‐recorded speech source mixtures show that the proposed algorithm achieves better separation results than the state‐of‐the‐art methods.

Keywords