xmlui.dri2xhtml.METS-1.0.item-contributor-funder:
Comunidad de Madrid Ministerio de Economía y Competitividad (España) Universidad Carlos III de Madrid Ministerio de Educación, Cultura y Deporte (España)
Sponsor:
This research was funded by the Spanish Government through the CICYT projects
(TRA2016-78886-C3-1-R and RTI2018-096036-B-C21), Universidad Carlos III of Madrid through (PEAVAUTO-CM-UC3M), the Comunidad de Madrid through SEGVAUTO-4.0-CM (P2018/EMT-4362), and the Ministerio de Educación, Cultura y Deporte para la Formación de Profesorado Universitario (FPU14/02143).
Project:
Gobierno de España. TRA2016-78886-C3-1-R Gobierno de España. RTI2018-096036-B-C21 Gobierno de España. FPU14/02143 Comunidad de Madrid. P2018/EMT-4362-SEGVAUTO-4.0-CM Comunidad de Madrid. PEAVAUTO-CM-UC3M
Keywords:
Appearance affinity
,
Triplet model
,
Contrastive loss function
,
Deep convolutional
,
Neural network
,
Re-identification
,
Multi-object tracking
Recognizing the identity of a query individual in a surveillance sequence is the core of Multi-Object Tracking (MOT) and Re-Identification (Re-Id) algorithms. Both tasks can be addressed by measuring the appearance affinity between people observations with a dRecognizing the identity of a query individual in a surveillance sequence is the core of Multi-Object Tracking (MOT) and Re-Identification (Re-Id) algorithms. Both tasks can be addressed by measuring the appearance affinity between people observations with a deep neural model. Nevertheless, the differences in their specifications and, consequently, in the characteristics and constraints of the available training data for each one of these tasks, arise from the necessity of employing different learning approaches to attain each one of them. This article offers a comparative view of the Double-Margin-Contrastive and the Triplet loss function, and analyzes the benefits and drawbacks of applying each one of them to learn an Appearance Affinity model for Tracking and Re-Identification. A batch of experiments have been conducted, and their results support the hypothesis concluded from the presented study: Triplet loss function is more effective than the Contrastive one when an Re-Id model is learnt, and, conversely, in the MOT domain, the Contrastive loss can better discriminate between pairs of images rendering the same person or not.[+][-]