Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length Embedding
arXiv (Cornell University)(2020)
Key words
few-shot,text-to-speech (TTS),neural TTS,multi-speaker modeling,speaker embedding
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined