Baybars Külebi, Alp Öktem, Alex Peiró-Lilja, Santiago Pascual, Mireia Farrús
CATOTRON – A Neural Text-to-Speech System in Catalan.
In: Interspeech 2020; 2020 Oct 25-29; Shanghai, China. (Online) 

Abstract

We present Catotron, a neural network-based open-source speech synthesis system in Catalan. Catotron consists of a sequence-to-sequence model trained with two small open-source datasets based on semi-spontaneous and read speech. We demonstrate how a neural TTS can be built for languages with limited resources using found-data optimization and cross-lingual transfer learning. We make the datasets, initial models and source code publicly available for both commercial and research purposes.

Access

Media mentions

Catotron, the first free, open speech synthesis system, based on neural networks by Recerca UPF
Catotron es renova per impulsar la síntesi de veu en català by Metadata.cat
Siri o Alexa en català? Una nova eina ho facilita by Diari més
La cooperativa Col·lectivaT crea el primer motor de síntesi de veu en català by La República
El primer motor de síntesi de veu en català ja és una realitat by Nació digital
La cooperativa Col·lectivaT crea el primer motor de síntesi de veu en català by El Punt Avui

Abstract

Access

Related material

Media mentions

Presentation video