Publication:
Histogram Equalization-Based Features for Speech, Music and Song Discrimination

Loading...
Thumbnail Image
Identifiers
Publication date
2010-07
Defense date
Advisors
Tutors
Journal Title
Journal ISSN
Volume Title
Publisher
IEEE
Impact
Google Scholar
Export
Research Projects
Organizational Units
Journal Issue
Abstract
In this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distributions computed at segment level and a reference distribution. Results show that PHEQ characteristics outperform short-term features such as Mel Frequency Cepstrum Coefficients (MFCC) and conventional segment-based ones such as MFCC mean and variance. Furthermore, the combination of short-term and PHEQ features significantly improves the performance of the whole system.
Description
Keywords
Speech/Music/Song discrimination, Audio classification, HEQ-based features, Acoustic features, Parameterization
Bibliographic citation
Gallardo-Antolin, A. & Montero, J. M. (2010). Histogram Equalization-Based Features for Speech, Music, and Song Discrimination. IEEE Signal Processing Letters, 17(7), pp. 659–662.