Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

García-Moral, Ana I.; Solera Ureña, R.; Peláez Moreno, Carmen; Díaz de María, Fernando

Publication:
Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

Identifiers

URI: http://hdl.handle.net/10016/13074

ISSN: 1558-7916

DOI: 10.1109/TASL.2010.2050513

Files

TASLP09_revised_doublecolumn.pdf (348.25 KB)

Publication date

2011-03

Authors

García-Moral, Ana I.

Solera Ureña, R.

Peláez Moreno, Carmen

Díaz de María, Fernando

Publisher

IEEE

Impact

Export

Abstract

Hybrid speech recognizers, where the estimation of the emission pdf of the states of Hidden Markov Models (HMMs), usually carried out using Gaussian Mixture Models (GMMs), is substituted by Artificial Neural Networks (ANNs) have several advantages over the classical systems. However, to obtain performance improvements, the computational requirements are heavily increased because of the need to train the ANN. Departing from the observation of the remarkable skewness of speech data, this paper proposes sifting out the training set and balancing the amount of samples per class. With this method the training time has been reduced 18 times while obtaining performances similar to or even better than those with the whole database, especially in noisy environments. However, the application of these reduced sets is not straightforward. To avoid the mismatch between training and testing conditions created by the modification of the distribution of the training data, a proper scaling of the a posteriori probabilities obtained and a resizing of the context window need to be performed as demonstrated in the paper.

Keywords

Robust ASR, Additive noise, Machine learning, Hybrid ASR, Artificial Neural Networks, Multilayer Perceptrons, Hidden Markov Models, Active Learning, ANN/HMM, MLP/HMM

Bibliographic citation

IEEE Transactions on Audio, Speech, and Language Processing, 19(3), Mar. 2011, pp. 468–481

Collections

DTSC - GPM - Artículos de Revistas

Full item page

Publication:
Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication: Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication:
Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems