Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning
INTERSPEECH, pp. 2080-2084, 2019.
We present a multispeaker, multilingual text-to-speech (TTS) synthesis model based on Tacotron that is able to produce high quality speech in multiple languages. Moreover, the model is able to transfer voices across languages, e.g. synthesize fluent Spanish speech using an English speaker's voice, without training on any bilingual or pa...More
PPT (Upload PPT)