Español English Contacte con nosotros http://www.uc3m.es/portal/page/portal/biblioteca
DSpace e-Archivo

Archivo Abierto Institucional de la Universidad Carlos III de Madrid > Investigación > Departamentos > Departamento de Teoría de la Señal y Comunicaciones > Grupo de Procesado Multimedia > DTSC - GPM - Capítulos de Monografías >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10016/2360

Files in This Item:
solera07.pdfPostprint285,62 kBAdobe PDFformato pdf
Title: SVMs for Automatic Speech Recognition: a Survey
Author(s): Solera Ureña, R.
Padrell Sendra, J.
Martín Iglesias, D.
Gallardo-Antolín, Asunción
Peláez-Moreno, Carmen
Días de María, F.
Publisher: Springer
Issued date: 2007
Citation: Progress in Nonlinear Speech Processing. Springer, 2007. ISBN 978-3-540-71503-0. PP. 190-216
URI: http://hdl.handle.net/10016/2360
ISBN: 978-3-540-71503-0
ISSN: 0302-9743 (Print)
1611-3349 (Online)
DOI: 10.1007/978-3-540-71505-4_11
Abstract: Hidden Markov Models (HMMs) are, undoubtedly, the most employed core technique for Automatic Speech Recognition (ASR). Nevertheless, we are still far from achieving high-performance ASR systems. Some alternative approaches, most of them based on Artificial Neural Networks (ANNs), were proposed during the late eighties and early nineties. Some of them tackled the ASR problem using predictive ANNs, while others proposed hybrid HMM/ANN systems. However, despite some achievements, nowadays, the preponderance of Markov Models is a fact. During the last decade, however, a new tool appeared in the field of machine learning that has proved to be able to cope with hard classification problems in several fields of application: the Support Vector Machines (SVMs). The SVMs are effective discriminative classifiers with several outstanding characteristics, namely: their solution is that with maximum margin; they are capable to deal with samples of a very higher dimensionality; and their convergence to the minimum of the associated cost function is guaranteed. These characteristics have made SVMs very popular and successful. In this chapter we discuss their strengths and weakness in the ASR context and make a review of the current state-of-the-art techniques. We organize the contributions in two parts: isolated-word recognition and continuous speech recognition. Within the first part we review several techniques to produce the fixed-dimension vectors needed for original SVMs. Afterwards we explore more sophisticated techniques based on the use of kernels capable to deal with sequences of different length. Among them is the DTAK kernel, simple and effective, which rescues an old technique of speech recognition: Dynamic Time Warping (DTW). Within the second part, we describe some recent approaches to tackle more complex tasks like connected digit recognition or continuous speech recognition using SVMs. Finally we draw some conclusions and outline several ongoing lines of research.
Review: PeerReviewed
Serie / Nº.: Lecture Notes on Computer Science
Volume 4391/2007
Publisher version: http://www.springerlink.com/content/r828226517290181/fulltext.pdf
Appears in Collections:DTSC - GPM - Capítulos de Monografías

Refworks Export

SFX Query

Items in E-Archivo are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! © Universidad Carlos III de Madrid - Software DSpace - Terms of use - Feedback