Publication:
Data Augmentation for Speaker Identification under Stress Conditions to Combat Gender-Based Violence

Loading...
Thumbnail Image
Identifiers
Publication date
2019-06-04
Defense date
Advisors
Tutors
Journal Title
Journal ISSN
Volume Title
Publisher
MDPI
Impact
Google Scholar
Export
Research Projects
Organizational Units
Journal Issue
Abstract
A Speaker Identification system for a personalized wearable device to combat gender-based violence is presented in this paper. Speaker recognition systems exhibit a decrease in performance when the user is under emotional or stress conditions, thus the objective of this paper is to measure the effects of stress in speech to ultimately try to mitigate their consequences on a speaker identification task, by using data augmentation techniques specifically tailored for this purpose given the lack of data resources for this condition. An extensive experimentation has been carried out for assessing the effectiveness of the proposed techniques. First, we conclude that the best performance is always obtained when naturally stressed samples are included in the training set, and second, when these are not available, their substitution and augmentation with synthetically generated stress-like samples improves the performance of the system.
Description
This article belongs to the Special Issue IberSPEECH 2018: Speech and Language Technologies for Iberian Languages
Keywords
Speaker identification, Emotions, Stress conditions, Data augmentation, Synthetic stress
Bibliographic citation
Rituerto-González, E., Mínguez-Sánchez, A., Gallardo-Antolín, A. y Peláez-Moreno, C. (2019). Data Augmentation for Speaker Identification under Stress Conditions to Combat Gender-Based Violence. Applied Sciences, 9(11), 2298.