Auditory-inspired morphological processing of speech spectrograms: applications in automatic speech recognition and speech enhancement

Cadore, Joyner; Valverde Albacete, Francisco José; Gallardo Antolín, Ascensión; Peláez Moreno, Carmen

Publication:
Auditory-inspired morphological processing of speech spectrograms: applications in automatic speech recognition and speech enhancement

Identifiers

URI: https://hdl.handle.net/10016/15932

ISSN: 1866-9956 (Print)

ISSN: 1866-9964 (Online)

DOI: 10.1007/s12559-012-9196-6

Files

cc_2012.pdf (1.21 MB)

Publication date

2012-11

Authors

Cadore, Joyner

Valverde Albacete, Francisco José

Gallardo Antolín, Ascensión

Peláez Moreno, Carmen

Publisher

Springer

Impact

Export

Abstract

New auditory-inspired speech processing methods are presented in this paper, combining spectral subtraction and two-dimensional non-linear filtering techniques originally conceived for image processing purposes. In particular, mathematical morphology operations, like erosion and dilation, are applied to noisy speech spectrograms using specifically designed structuring elements inspired in the masking properties of the human auditory system. This is effectively complemented with a pre-processing stage including the conventional spectral subtraction procedure and auditory filterbanks. These methods were tested in both speech enhancement and automatic speech recognition tasks. For the first, time-frequency anisotropic structuring elements over grey-scale spectrograms were found to provide a better perceptual quality than isotropic ones, revealing themselves as more appropriate—under a number of perceptual quality estimation measures and several signal-to-noise ratios on the Aurora database—for retaining the structure of speech while removing background noise. For the second, the combination of Spectral Subtraction and auditory-inspired Morphological Filtering was found to improve recognition rates in a noise-contaminated version of the Isolet database.

Keywords

Spectral subtraction, Spectrogram, Morphological processing, Image filtering, Automatic speech recognition, Speech enhancement, Auditory-based features

Bibliographic citation

Cognitive Computation, December 2013, 5(4), pp. 426-441.

Collections

DTSC - GPM - Artículos de Revistas

Full item page

Publication:
Auditory-inspired morphological processing of speech spectrograms: applications in automatic speech recognition and speech enhancement

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication: Auditory-inspired morphological processing of speech spectrograms: applications in automatic speech recognition and speech enhancement

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication:
Auditory-inspired morphological processing of speech spectrograms: applications in automatic speech recognition and speech enhancement