Publication: A simulated annealing approach to speaker segmentation in audio databases
Identifiers
Publication date
2008-06
Defense date
Advisors
Tutors
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Abstract
In this paper we present a novel approach to the problem of speaker segmentation, which is an unavoidable previous step to audio indexing. Mutual information is used for evaluating the accuracy of the segmentation, as a function to be maximized by a simulated annealing (SA) algorithm. We introduce a novel mutation operator for the SA, the Consecutive Bits Mutation operator, which improves the performance of the SA in this problem. We also use the so-called Compaction Factor, which allows the SA to operate in a reduced search space. Our algorithm has been tested in the segmentation of real audio databases, and it has been compared to several existing algorithms for speaker segmentation, obtaining very good results in the test problems considered.
Description
Keywords
Speaker segmentation, Simulated annealing, Information theory, Audio indexing
Bibliographic citation
Leiva-Murillo, J. M., Salcedo-Sanz, S., Gallardo-Antolín, A. & Artés-Rodríguez, A. (2008). A simulated annealing approach to speaker segmentation in audio databases. Engineering Applications of Artificial Intelligence, 21(4), pp. 499–508.