Speaker recognition under stress conditions

Rituerto González, Esther; Gallardo Antolín, Ascensión; Peláez Moreno, Carmen

Publication:
Speaker recognition under stress conditions

dc.affiliation.dpto	UC3M. Departamento de Teoría de la Señal y Comunicaciones	es
dc.affiliation.grupoinv	UC3M. Grupo de Investigación: Procesado Multimedia	es
dc.contributor.author	Rituerto González, Esther
dc.contributor.author	Gallardo Antolín, Ascensión
dc.contributor.author	Peláez Moreno, Carmen
dc.contributor.funder	Ministerio de Economía y Competitividad (España)	es
dc.date.accessioned	2019-10-16T10:29:23Z
dc.date.available	2019-10-16T10:29:23Z
dc.date.issued	2018-11
dc.description	Proceeding of: IberSPEECH 2018, 21-23 November 2018, Barcelona, Spain	en
dc.description.abstract	Speaker recognition systems exhibit a decrease in performance when the input speech is not in optimal circumstances, for example when the user is under emotional or stress conditions. The objective of this paper is measuring the effects of stress on speech to ultimately try to mitigate its consequences on a speaker recognition task. On this paper, we develop a stress-robust speaker identification system using data selection and augmentation by means of the manipulation of the original speech utterances. An extensive experimentation has been carried out for assessing the effectiveness of the proposed techniques. First, we concluded that the best performance is always obtained when naturally stressed samples are included in the training set, and second, when these are not available, their substitution and augmentation with synthetically generated stress-like samples, improves the performance of the system.	en
dc.description.sponsorship	This work is partially supported by the Spanish Government-MinECo projects TEC2014-53390-P and TEC2017-84395-P.	en
dc.format.extent	5	es
dc.identifier.bibliographicCitation	Proceedings of IberSPEECH 2018, Pp. 15-19	en
dc.identifier.doi	https://doi.org/10.21437/IberSPEECH.2018-4
dc.identifier.publicationfirstpage	15	es
dc.identifier.publicationlastpage	19	es
dc.identifier.publicationtitle	Proceedings of IberSPEECH 2018, 21-23 November 2018, Barcelona, Spain	es
dc.identifier.uri	https://hdl.handle.net/10016/29035
dc.identifier.uxxi	CC/0000028620
dc.language.iso	eng	es
dc.relation.eventdate	2018-11-21	es
dc.relation.eventplace	BARCELONA	es
dc.relation.eventtitle	IberSPEECH 2018	es
dc.relation.projectID	Gobierno de España. TEC2014-53390-P	es
dc.relation.projectID	Gobierno de España. TEC2017-84395-P	es
dc.rights.accessRights	open access	es
dc.subject.eciencia	Electrónica	es
dc.subject.eciencia	Telecomunicaciones	es
dc.subject.other	Speaker recognition	en
dc.subject.other	Speaker identification	en
dc.subject.other	Emotions	en
dc.subject.other	Stress conditions	en
dc.subject.other	Data augmentation	en
dc.subject.other	Synthetic stress	en
dc.title	Speaker recognition under stress conditions	en
dc.type	conference paper	*
dc.type.hasVersion	VoR	*
dspace.entity.type	Publication

Files

Original bundle

Now showing 1 - 1 of 1

Name:: speaker_IBERSPEECH_2018.pdf
Size:: 130.55 KB
Format:: Adobe Portable Document Format

Download

Collections

DTSC - GPM - Comunicaciones en congresos y otros eventos

Publication: Speaker recognition under stress conditions

Files

Original bundle

Collections

Publication:
Speaker recognition under stress conditions