NMF-based temporal feature integration for acoustic event classification

Thumbnail Image
ISSN: 2308-457X
ISBN: 9781629934433
Publication date
Defense date
Journal Title
Journal ISSN
Volume Title
International Speech Communication Association
Google Scholar
Research Projects
Organizational Units
Journal Issue
In this paper, we propose a new front-end for Acoustic Event Classification tasks (AEC) based on the combination of the temporal feature integration technique called Filter Bank Coefficients (FC) and Non-Negative Matrix Factorization (NMF). FC aims to capture the dynamic structure in the short-term features by means of the summarization of the periodogram of each short-term feature dimension in several frequency bands using a predefined filter bank. As the commonly used filter bank has been devised for other tasks (such as music genre classification), it can be suboptimal for AEC. In order to overcome this drawback, we propose an unsupervised method based on NMF for learning the filters which collect the most relevant temporal information in the short-time features for AEC. The experiments show that the features obtained with this method achieve significant improvements in the classification performance of a Support Vector Machine (SVM) based AEC system in comparison with the baseline FC features.
Proceedings of: 14th Annual Conference of the International Speech Communication Association. Lyon, France, 25-29 August 2013.
Acoustic event classification, Temporal feature integration, Won-negative matrix factorization
Bibliographic citation
Bimbot, F. et al. (eds.) (2013). INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013. (pp. 2924-2928). International Speech Communication Association.