Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis

Guanghui Xu
Guanghui Xu
Wei Song
Wei Song
Zhengchen Zhang
Zhengchen Zhang
Chao Zhang
Chao Zhang
Cited by: 0|Bibtex|Views12
Other Links: arxiv.org

Abstract:

Despite prosody is related to the linguistic information up to the discourse structure, most text-to-speech (TTS) systems only take into account that within each sentence, which makes it challenging when converting a paragraph of texts into natural and expressive speech. In this paper, we propose to use the text embeddings of the neighb...More

Code:

Data:

Your rating :
0

 

Tags
Comments