Publication: Speaker recognition under stress conditions
dc.affiliation.dpto | UC3M. Departamento de Teoría de la Señal y Comunicaciones | es |
dc.affiliation.grupoinv | UC3M. Grupo de Investigación: Procesado Multimedia | es |
dc.contributor.author | Rituerto González, Esther | |
dc.contributor.author | Gallardo Antolín, Ascensión | |
dc.contributor.author | Peláez Moreno, Carmen | |
dc.contributor.funder | Ministerio de Economía y Competitividad (España) | es |
dc.date.accessioned | 2019-10-16T10:29:23Z | |
dc.date.available | 2019-10-16T10:29:23Z | |
dc.date.issued | 2018-11 | |
dc.description | Proceeding of: IberSPEECH 2018, 21-23 November 2018, Barcelona, Spain | en |
dc.description.abstract | Speaker recognition systems exhibit a decrease in performance when the input speech is not in optimal circumstances, for example when the user is under emotional or stress conditions. The objective of this paper is measuring the effects of stress on speech to ultimately try to mitigate its consequences on a speaker recognition task. On this paper, we develop a stress-robust speaker identification system using data selection and augmentation by means of the manipulation of the original speech utterances. An extensive experimentation has been carried out for assessing the effectiveness of the proposed techniques. First, we concluded that the best performance is always obtained when naturally stressed samples are included in the training set, and second, when these are not available, their substitution and augmentation with synthetically generated stress-like samples, improves the performance of the system. | en |
dc.description.sponsorship | This work is partially supported by the Spanish Government-MinECo projects TEC2014-53390-P and TEC2017-84395-P. | en |
dc.format.extent | 5 | es |
dc.identifier.bibliographicCitation | Proceedings of IberSPEECH 2018, Pp. 15-19 | en |
dc.identifier.doi | https://doi.org/10.21437/IberSPEECH.2018-4 | |
dc.identifier.publicationfirstpage | 15 | es |
dc.identifier.publicationlastpage | 19 | es |
dc.identifier.publicationtitle | Proceedings of IberSPEECH 2018, 21-23 November 2018, Barcelona, Spain | es |
dc.identifier.uri | https://hdl.handle.net/10016/29035 | |
dc.identifier.uxxi | CC/0000028620 | |
dc.language.iso | eng | es |
dc.relation.eventdate | 2018-11-21 | es |
dc.relation.eventplace | BARCELONA | es |
dc.relation.eventtitle | IberSPEECH 2018 | es |
dc.relation.projectID | Gobierno de España. TEC2014-53390-P | es |
dc.relation.projectID | Gobierno de España. TEC2017-84395-P | es |
dc.rights.accessRights | open access | es |
dc.subject.eciencia | Electrónica | es |
dc.subject.eciencia | Telecomunicaciones | es |
dc.subject.other | Speaker recognition | en |
dc.subject.other | Speaker identification | en |
dc.subject.other | Emotions | en |
dc.subject.other | Stress conditions | en |
dc.subject.other | Data augmentation | en |
dc.subject.other | Synthetic stress | en |
dc.title | Speaker recognition under stress conditions | en |
dc.type | conference paper | * |
dc.type.hasVersion | VoR | * |
dspace.entity.type | Publication |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- speaker_IBERSPEECH_2018.pdf
- Size:
- 130.55 KB
- Format:
- Adobe Portable Document Format