Applied Sciences (Jun 2019)

Data Augmentation for Speaker Identification under Stress Conditions to Combat Gender-Based Violence

  • Esther Rituerto-González,
  • Alba Mínguez-Sánchez,
  • Ascensión Gallardo-Antolín,
  • Carmen Peláez-Moreno

DOI
https://doi.org/10.3390/app9112298
Journal volume & issue
Vol. 9, no. 11
p. 2298

Abstract

Read online

A Speaker Identification system for a personalized wearable device to combat gender-based violence is presented in this paper. Speaker recognition systems exhibit a decrease in performance when the user is under emotional or stress conditions, thus the objective of this paper is to measure the effects of stress in speech to ultimately try to mitigate their consequences on a speaker identification task, by using data augmentation techniques specifically tailored for this purpose given the lack of data resources for this condition. An extensive experimentation has been carried out for assessing the effectiveness of the proposed techniques. First, we conclude that the best performance is always obtained when naturally stressed samples are included in the training set, and second, when these are not available, their substitution and augmentation with synthetically generated stress-like samples improves the performance of the system.

Keywords