Advances in Sciences and Technology (Mar 2018)

Advanced Time-Frequency Representation in Voice Signal Analysis

  • Dariusz Mika,
  • Jerzy Józwik

DOI
https://doi.org/10.12913/22998624/87028
Journal volume & issue
Vol. 12, no. 1
pp. 251 – 259

Abstract

Read online

The most commonly used time-frequency representation of the analysis in voice signal is spectrogram. This representation belongs in general to Cohen's class, the class of time-frequency energy distributions. From the standpoint of properties of the resolution spectrogram representation is not optimal. In Cohen class representations are known which have a better resolution properties. All of them are created by smoothing the Wigner-Ville'a (WVD) distribution characterized by the best resolution, however, the biggest harmful interference. Used smoothing functions decide about a compromise between the properties of resolution and eliminating harmful interference term. Another class of time-frequency energy distributions is the affine class of distributions. From the point of view of readability of analysis the best properties are known so called Redistribution of energy caused by the use of a general methodology referred to as reassignment to any time-frequency representation. Reassigned distributions efficiently combine a reduction of the interference terms provided by a well adapted smoothing kernel and an increased concentration of the signal components.

Keywords