Sample Efficient Adaptive Text-to-Speech
international conference on learning representations, 2019.
We present a meta-learning approach for adaptive text-to-speech (TTS) with few data. During training, we learn a multi-speaker model using a shared conditional WaveNet core and independent learned embeddings for each speaker. The aim of training is not to produce a neural network with fixed weights, which is then deployed as a TTS system....More
PPT (Upload PPT)