Deep-Sync: A novel deep learning-based tool for semantic-aware subtitling synchronisation

Martín, Alejandro; González Carrasco, Israel; Rodríguez Fernández, Víctor; Souto Rico, Mónica María; Camacho, David; Ruiz Mezcua, María Belén

Publication:
Deep-Sync: A novel deep learning-based tool for semantic-aware subtitling synchronisation

Identifiers

URI: https://hdl.handle.net/10016/33720

ISSN: 0941-0643

DOI: https://doi.org/10.1007/s00521-021-05751-y

UXXI: AR/0000028719

Files

deep_NCA_2021_ps.pdf (1.34 MB)

Publication date

2021-02-08

Authors

Martín, Alejandro

González Carrasco, Israel

Rodríguez Fernández, Víctor

Souto Rico, Mónica María

Camacho, David

Ruiz Mezcua, María Belén

Publisher

Springer

Impact

Export

Abstract

Subtitles are a key element to make any media content accessible for people who suffer from hearing impairment and for elderly people, but also useful when watching TV in a noisy environment or learning new languages. Most of the time, subtitles are generated manually in advance, building a verbatim and synchronised transcription of the audio. However, in TV live broadcasts, captions are created in real time by a re-speaker with the help of a voice recognition software, which inevitability leads to delays and lack of synchronisation. In this paper, we present Deep-Sync, a tool for the alignment of subtitles with the audio-visual content. The architecture integrates a deep language representation model and a real-time voice recognition software to build a semantic-aware alignment tool that successfully aligns most of the subtitles even when there is no direct correspondence between the re-speaker and the audio content. In order to avoid any kind of censorship, Deep-Sync can be deployed directly on users' TVs causing a small delay to perform the alignment, but avoiding to delay the signal at the broadcaster station. Deep-Sync was compared with other subtitles alignment tool, showing that our proposal is able to improve the synchronisation in all tested cases.

Keywords

tv broadcasting, synchronisation, language model, deep neural networks, machine learning

Bibliographic citation

Martín, A., González-Carrasco, I., Rodriguez-Fernandez, V. et al. Deep-Sync: A novel deep learning-based tool for semantic-aware subtitling synchronisation. Neural Comput & Applic (2021). https://doi.org/10.1007/s00521-021-05751-y

Collections

DI - LABDA - Artículos de Revistas

Full item page

Publication:
Deep-Sync: A novel deep learning-based tool for semantic-aware subtitling synchronisation

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication: Deep-Sync: A novel deep learning-based tool for semantic-aware subtitling synchronisation

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication:
Deep-Sync: A novel deep learning-based tool for semantic-aware subtitling synchronisation