|
Archivo Abierto Institucional de la Universidad Carlos III de Madrid >
Investigación >
Departamentos >
Departamento de Teoría de la Señal y Comunicaciones >
Grupo de Procesado Multimedia >
DTSC - GPM - Artículos de Revistas >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10016/9045
|
| Title: | The synergy between bounded-distance HMM and spectral subtraction for robust speech recognition |
| Author(s): | Vicente-Peña, Jesús Díaz-de-María, Fernando Kleijn, W. Bastiaan |
| Publisher: | Elsevier |
| Issued date: | Feb-2010 |
| Citation: | Jesús de-Vicente-Peña, Fernando Díaz-de-María and Baastian Klejin, “The Synergy between Bounded-Distance HMM and Spectral Subtraction for Robust Speech Recognition”, Speech Communication, Vol. 52, Nº 2, pp. 123-133, Feb. 2010. |
| URI: | http://hdl.handle.net/10016/9045 |
| ISSN: | 0167-6393 |
| DOI: | 10.1016/j.specom.2009.09.002 |
| Abstract: | Additive noise generates important losses in automatic speech recognition systems. In this paper, we show that one of the causes contributing to these losses is the fact that conventional recognisers take into consideration feature values that are outliers. The method that we call bounded-distance HMM is a suitable method to avoid that outliers contribute to the recogniser decision. However, this method just deals with outliers, leaving the remaining features unaltered. In contrast, spectral subtraction is able to correct all the features at the expense of introducing some artifacts that, as shown in the paper, cause a larger number of outliers. As a result, we find that bounded-distance HMM and spectral subtraction complement each other well. A comprehensive experimental evaluation was conducted, considering several well-known ASR tasks (of different complexities) and numerous noise types and SNRs. The achieved results show that the suggested combination generally outperforms both the bounded-distance HMM and spectral subtraction individually. Furthermore, the obtained improvements, especially for low and medium SNRs, are larger than the sum of the improvements individually obtained by bounded-distance HMM and spectral subtraction. |
| Review: | PeerReviewed |
| Publisher version: | http://dx.doi.org/10.1016/j.specom.2009.09.002 |
| Keywords: | Robust speech recognition Spectral subtraction Acoustic backing-off Bounded-distance HMM Missing features Outliers |
| Appears in Collections: | DTSC - GPM - Artículos de Revistas
|
Items in E-Archivo are protected by copyright, with all rights reserved, unless otherwise indicated.
|