Publication:
NMF-based temporal feature integration for acoustic event classification

dc.affiliation.dptoUC3M. Departamento de Teoría de la Señal y Comunicacioneses
dc.affiliation.grupoinvUC3M. Grupo de Investigación: Procesado Multimediaes
dc.contributor.authorGallardo Antolín, Ascensiónes
dc.contributor.authorLudeña Choez, Jimmy D.
dc.date.accessioned2015-07-29T10:57:08Z
dc.date.available2015-07-29T10:57:08Z
dc.date.issued2013
dc.descriptionProceedings of: 14th Annual Conference of the International Speech Communication Association. Lyon, France, 25-29 August 2013.en
dc.description.abstractIn this paper, we propose a new front-end for Acoustic Event Classification tasks (AEC) based on the combination of the temporal feature integration technique called Filter Bank Coefficients (FC) and Non-Negative Matrix Factorization (NMF). FC aims to capture the dynamic structure in the short-term features by means of the summarization of the periodogram of each short-term feature dimension in several frequency bands using a predefined filter bank. As the commonly used filter bank has been devised for other tasks (such as music genre classification), it can be suboptimal for AEC. In order to overcome this drawback, we propose an unsupervised method based on NMF for learning the filters which collect the most relevant temporal information in the short-time features for AEC. The experiments show that the features obtained with this method achieve significant improvements in the classification performance of a Support Vector Machine (SVM) based AEC system in comparison with the baseline FC features.en
dc.description.sponsorshipThis work has been partially supported by the Spanish Government grants TSI-020110-2009-103, IPT-120000-2010-24 and TEC2011-26807en
dc.description.statusPublicadoes
dc.format.extent5
dc.format.mimetypeapplication/pdf
dc.identifier.bibliographicCitationBimbot, F. et al. (eds.) (2013). INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013. (pp. 2924-2928). International Speech Communication Association.en
dc.identifier.isbn9781629934433
dc.identifier.issn2308-457X
dc.identifier.publicationfirstpage2924
dc.identifier.publicationlastpage2928
dc.identifier.publicationtitleINTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013.en
dc.identifier.urihttps://hdl.handle.net/10016/21474
dc.identifier.uxxiCC/0000022059
dc.language.isoeng
dc.publisherInternational Speech Communication Associationen
dc.relation.eventdate25-29 August 2013en
dc.relation.eventnumber14
dc.relation.eventplaceLyon, France.en
dc.relation.eventtitle14th Annual Conference of the International Speech Communication Association.en
dc.relation.projectIDGobierno de España. TEC2011-26807es
dc.relation.publisherversionhttp://www.isca-speech.org/archive/archive_papers/interspeech_2013/i13_2924.pdf
dc.rights© 2013 ISCA
dc.rights.accessRightsopen accessen
dc.subject.ecienciaTelecomunicacioneses
dc.subject.otherAcoustic event classificationen
dc.subject.otherTemporal feature integrationen
dc.subject.otherWon-negative matrix factorizationen
dc.titleNMF-based temporal feature integration for acoustic event classificationen
dc.typeconference paper*
dc.type.hasVersionVoR*
dspace.entity.typePublication
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
NMFbased_INTERSPEECH_2013.pdf
Size:
394.04 KB
Format:
Adobe Portable Document Format