Publication:
A simulated annealing approach to speaker segmentation in audio databases

Research Projects
Organizational Units
Journal Issue
Abstract
In this paper we present a novel approach to the problem of speaker segmentation, which is an unavoidable previous step to audio indexing. Mutual information is used for evaluating the accuracy of the segmentation, as a function to be maximized by a simulated annealing (SA) algorithm. We introduce a novel mutation operator for the SA, the Consecutive Bits Mutation operator, which improves the performance of the SA in this problem. We also use the so-called Compaction Factor, which allows the SA to operate in a reduced search space. Our algorithm has been tested in the segmentation of real audio databases, and it has been compared to several existing algorithms for speaker segmentation, obtaining very good results in the test problems considered.
Description
Keywords
Speaker segmentation, Simulated annealing, Information theory, Audio indexing
Bibliographic citation
Leiva-Murillo, J. M., Salcedo-Sanz, S., Gallardo-Antolín, A. & Artés-Rodríguez, A. (2008). A simulated annealing approach to speaker segmentation in audio databases. Engineering Applications of Artificial Intelligence, 21(4), pp. 499–508.