ASR Feature Extraction with Morphologically-Filtered Power-Normalized Cochleograms

Calle Silos, Fernando de la; Valverde Albacete, Francisco José; Gallardo Antolín, Ascensión; Peláez Moreno, Carmen

Publication:
ASR Feature Extraction with Morphologically-Filtered Power-Normalized Cochleograms

dc.affiliation.dpto	UC3M. Departamento de Teoría de la Señal y Comunicaciones	es
dc.affiliation.grupoinv	UC3M. Grupo de Investigación: Procesado Multimedia	es
dc.contributor.author	Calle Silos, Fernando de la	es
dc.contributor.author	Valverde Albacete, Francisco José	es
dc.contributor.author	Gallardo Antolín, Ascensión	es
dc.contributor.author	Peláez Moreno, Carmen	es
dc.date.accessioned	2015-07-30T11:19:46Z
dc.date.available	2015-07-30T11:19:46Z
dc.date.issued	2014
dc.description	Proceedings of: 15th Annual Conference of the International Speech Communication Association. Singapore, September 14-18, 2014.	en
dc.description.abstract	In this paper we present advances in the modeling of the masking behavior of the Human Auditory System to enhance the robustness of the feature extraction stage in Automatic Speech Recognition. The solution adopted is based on a non-linear filtering of a spectro-temporal representation applied simultaneously on both the frequency and time domains, by processing it using mathematical morphology operations as if it were an image. A particularly important component of this architecture is the so called structuring element: biologically-based considerations are addressed in the present contribution to design an element that closely resembles the masking phenomena taking place in the cochlea. The second feature of this contribution is the choice of underlying spectro-temporal representation. The best results were achieved by the representation introduced as part of the Power Normalized Cepstral Coefficients together with a spectral subtraction step. On the Aurora 2 noisy continuous digits task, we report relative error reductions of 18.7% compared to PNCC and 39.5% compared to MFCC.	en
dc.description.sponsorship	This contribution has been supported by an Airbus Defense and Space Grant (Open Innovation - SAVIER) and Spanish Government-CICYT project 2011-26807/TEC.	en
dc.description.status	Publicado	es
dc.format.extent	5
dc.format.mimetype	application/pdf
dc.identifier.bibliographicCitation	Li, Haizhou, et al. (eds). (2014). INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014. (pp. 2430-2434). International Speech Communication Association.	en
dc.identifier.isbn	9781634394352
dc.identifier.publicationfirstpage	2430
dc.identifier.publicationlastpage	2434
dc.identifier.publicationtitle	INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014.	en
dc.identifier.uri	https://hdl.handle.net/10016/21480
dc.identifier.uxxi	CC/0000022423
dc.language.iso	eng	en
dc.publisher	International Speech Communication Association	en
dc.relation.eventdate	September 14-18, 2014.	en
dc.relation.eventnumber	15
dc.relation.eventplace	Singapore	en
dc.relation.eventtitle	Annual Conference of the International Speech Communication Association (INTERSPEECH 2014)	en
dc.relation.projectID	Gobierno de España. TEC2011-26807	es
dc.relation.publisherversion	http://www.isca-speech.org/archive/archive_papers/interspeech_2014/i14_2430.pdf	en
dc.rights	© 2014 ISCA	es
dc.rights.accessRights	open access	es
dc.subject.eciencia	Telecomunicaciones	es
dc.subject.other	Spectro-temporal processing	en
dc.subject.other	Morphological filtering	en
dc.subject.other	Automatic speech recognition	en
dc.subject.other	Auditory-based features	en
dc.subject.other	PNCC	en
dc.title	ASR Feature Extraction with Morphologically-Filtered Power-Normalized Cochleograms	en
dc.type	conference poster	*
dc.type.hasVersion	VoR	*
dspace.entity.type	Publication

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Feature_INTERSPEECH_2014.pdf
Size:: 342.09 KB
Format:: Adobe Portable Document Format

Download

Collections

DTSC - GPM - Comunicaciones en congresos y otros eventos
DTSC - GPM - Capítulos de Monografías

Publication: ASR Feature Extraction with Morphologically-Filtered Power-Normalized Cochleograms

Files

Original bundle

Collections

Publication:
ASR Feature Extraction with Morphologically-Filtered Power-Normalized Cochleograms