Citation:
Gallardo-Antolin, A. & Montero, J. M. (2010). Histogram Equalization-Based Features for Speech, Music, and Song Discrimination. IEEE Signal Processing Letters, 17(7), pp. 659–662.
xmlui.dri2xhtml.METS-1.0.item-contributor-funder:
Ministerio de Economía y Competitividad (España) Comunidad de Madrid Universidad Carlos III de Madrid Ministerio de Educación, Cultura y Deporte (España)
Sponsor:
This work was supported in part by the regional project Comunidad de Madrid-UC3M CCG08-UC3M/TIC-4457 and the Spanish Government Projects TEC2008-06382 and DPI2007-66846-C02-02.
Project:
Comunidad de Madrid. CCG08-UC3M/TIC-4457 Gobierno de España. TEC2008-06382 Gobierno de España. DPI2007-66846-C02-02
Keywords:
Speech/Music/Song discrimination
,
Audio classification
,
HEQ-based features
,
Acoustic features
,
Parameterization
In this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distribuIn this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distributions computed at segment level and a reference distribution. Results show that PHEQ characteristics outperform short-term features such as Mel Frequency Cepstrum Coefficients (MFCC) and conventional segment-based ones such as MFCC mean and variance. Furthermore, the combination of short-term and PHEQ features significantly improves the performance of the whole system.[+][-]