Español English Contacte con nosotros http://www.uc3m.es/portal/page/portal/biblioteca
DSpace e-Archivo

Archivo Abierto Institucional de la Universidad Carlos III de Madrid > Investigación > Departamentos > Departamento de Teoría de la Señal y Comunicaciones > Grupo de Procesado Multimedia > DTSC - GPM - Artículos de Revistas >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10016/2322

Files in This Item:
solera07.pdfversión preprint276,27 kBAdobe PDFformato pdf
Title: Robust ASR using Support Vector Machines
Author(s): Solera Ureña, R.
Martín Iglesias, D.
Gallardo-Antolín, Asunción
Peláez-Moreno, Carmen
Díaz-de-María, Fernando
Publisher: European Association for Signal Processing (EURASIP) : International Speech Communication Association (ISCA)
Issued date: 2007
Citation: Speech Communication. Vol. 49, No. 4, Abril 2007, pp. 253-267
URI: http://hdl.handle.net/10016/2322
ISSN: 0167-6393
DOI: 10.1016/j.specom.2007.01.013
Abstract: The improved theoretical properties of Support Vector Machines with respect to other machine learning alternatives due to their max-margin training paradigm have led us to suggest them as a good technique for robust speech recognition. However, important shortcomings have had to be circumvented, the most important being the normalisation of the time duration of different realisations of the acoustic speech units. In this paper, we have compared two approaches in noisy environments: first, a hybrid HMM–SVM solution where a fixed number of frames is selected by means of an HMM segmentation and second, a normalisation kernel called Dynamic Time Alignment Kernel (DTAK) first introduced in Shimodaira et al. [Shimodaira, H., Noma, K., Nakai, M., Sagayama, S., 2001. Support vector machine with dynamic time-alignment kernel for speech recognition. In: Proc. Eurospeech, Aalborg, Denmark, pp. 1841–1844] and based on DTW (Dynamic Time Warping). Special attention has been paid to the adaptation of both alternatives to noisy environments, comparing two types of parameterisations and performing suitable feature normalisation operations. The results show that the DTA Kernel provides important advantages over the baseline HMM system in medium to bad noise conditions, also outperforming the results of the hybrid system.
Review: PeerReviewed
Publisher version: http://www.sciencedirect.com/science?_ob= ArticleURL&_udi=B6V1C-4N206V6-2&_user= 143961&_coverDate=04%2F30%2F2007&_alid= 648032093&_rdoc=4&_fmt=full&_orig= search&_cdi=5671&_sort=d&_docanchor=&view= c&_ct=8&_acct=C000011938&_version=1&_urlVersion= 0&_userid=143961&md5=1f00932579916f207ad2550342396fb9
Keywords: Robust ASR
Additive noise
Machine learning
Support Vector Machines
Kernel methods
HMM
ANN
Hybrid ASR
Dynamic Time Alignment
Appears in Collections:DTSC - GPM - Artículos de Revistas

Refworks Export

SFX Query

This item is licensed under a Creative Commons License
Creative Commons

Items in E-Archivo are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! © Universidad Carlos III de Madrid - Software DSpace - Terms of use - Feedback