Español English Contacte con nosotros http://www.uc3m.es/portal/page/portal/biblioteca
DSpace e-Archivo

Archivo Abierto Institucional de la Universidad Carlos III de Madrid > Investigación > Departamentos > Departamento de Teoría de la Señal y Comunicaciones > Grupo de Procesado Multimedia > DTSC - GPM - Artículos de Revistas >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10016/13074

Files in This Item:
TASLP09_revised_doublecolumn.pdf348,25 kBAdobe PDFformato pdf
Title: Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems
Author(s): García-Moral, Ana I.
Solera Ureña, R.
Peláez-Moreno, Carmen
Díaz-de-María, Fernando
Publisher: IEEE
Issued date: Mar-2011
Citation: IEEE Transactions on Audio, Speech, and Language Processing, 19(3), Mar. 2011, pp. 468–481
URI: http://hdl.handle.net/10016/13074
ISSN: 1558-7916
DOI: http://dx.doi.org/10.1109/TASL.2010.2050513
Abstract: Hybrid speech recognizers, where the estimation of the emission pdf of the states of Hidden Markov Models (HMMs), usually carried out using Gaussian Mixture Models (GMMs), is substituted by Artificial Neural Networks (ANNs) have several advantages over the classical systems. However, to obtain performance improvements, the computational requirements are heavily increased because of the need to train the ANN. Departing from the observation of the remarkable skewness of speech data, this paper proposes sifting out the training set and balancing the amount of samples per class. With this method the training time has been reduced 18 times while obtaining performances similar to or even better than those with the whole database, especially in noisy environments. However, the application of these reduced sets is not straightforward. To avoid the mismatch between training and testing conditions created by the modification of the distribution of the training data, a proper scaling of the a posteriori probabilities obtained and a resizing of the context window need to be performed as demonstrated in the paper.
Sponsor: This work was supported in part by the regional grant (Comunidad Autónoma de Madrid-UC3M) CCG06-UC3M/TIC-0812 and in part by a project funded by the Spanish Ministry of Science and Innovation (TEC 2008-06382).
Publisher version: http://dx.doi.org/10.1109/TASL.2010.2050513
Keywords: Robust ASR
Additive noise
Machine learning
Hybrid ASR
Artificial Neural Networks
Multilayer Perceptrons
Hidden Markov Models
Active Learning
ANN/HMM
MLP/HMM
Rights: © IEEE
Appears in Collections:DTSC - GPM - Artículos de Revistas

Refworks Export

SFX Query

Items in E-Archivo are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! © Universidad Carlos III de Madrid - Software DSpace - Terms of use - Feedback