Automatic extraction of parallel speech corpora from dubbed movies
Öktem A, Farrús M, Wanner L.
Automatic Extraction of Parallel Speech Corpora from Dubbed Movies
Proceedings of the 10th Workshop on Building and Using Comparable Corpora (BUCC); 2017 30 July - 4 Aug; Vancouver, Canada.
ACL, 2017. p. 31-35.
Abstract
This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. The obtained parallel corpora are especially suitable for speech-to-speech translation applications when a prosody transfer between source and target languages is desired.
Access
The paper can be accessed through the following repositories: