Robust ASR using Support Vector Machines

Solera Ureña, R.; Martín Iglesias, D.; Gallardo Antolín, Ascensión; Peláez Moreno, Carmen; Díaz de María, Fernando

Publication:
Robust ASR using Support Vector Machines

Identifiers

URI: http://hdl.handle.net/10016/2322

ISSN: 0167-6393

DOI: 10.1016/j.specom.2007.01.013

Files

solera07.pdf (276.27 KB)

Publication date

2007

Authors

Solera Ureña, R.

Martín Iglesias, D.

Gallardo Antolín, Ascensión

Peláez Moreno, Carmen

Díaz de María, Fernando

Publisher

European Association for Signal Processing (EURASIP) : International Speech Communication Association (ISCA)

Impact

Export

Abstract

The improved theoretical properties of Support Vector Machines with respect to other machine learning alternatives due to their max-margin training paradigm have led us to suggest them as a good technique for robust speech recognition. However, important shortcomings have had to be circumvented, the most important being the normalisation of the time duration of different realisations of the acoustic speech units. In this paper, we have compared two approaches in noisy environments: first, a hybrid HMM–SVM solution where a fixed number of frames is selected by means of an HMM segmentation and second, a normalisation kernel called Dynamic Time Alignment Kernel (DTAK) first introduced in Shimodaira et al. [Shimodaira, H., Noma, K., Nakai, M., Sagayama, S., 2001. Support vector machine with dynamic time-alignment kernel for speech recognition. In: Proc. Eurospeech, Aalborg, Denmark, pp. 1841–1844] and based on DTW (Dynamic Time Warping). Special attention has been paid to the adaptation of both alternatives to noisy environments, comparing two types of parameterisations and performing suitable feature normalisation operations. The results show that the DTA Kernel provides important advantages over the baseline HMM system in medium to bad noise conditions, also outperforming the results of the hybrid system.

Keywords

Robust ASR, Additive noise, Machine learning, Support vector machines, Kernel methods, HMM, ANN, Hybrid ASR, Dynamic Time Alignment

Bibliographic citation

Speech Communication. Vol. 49, No. 4, Abril 2007, pp. 253-267

Collections

DTSC - GPM - Artículos de Revistas

Full item page

Publication:
Robust ASR using Support Vector Machines

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication: Robust ASR using Support Vector Machines

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication:
Robust ASR using Support Vector Machines