DNN Based Expressive Text-to-Speech with Limited Training Data

Sinisa Suzie,Tijana Nosek,Milan Secujski,Darko Pekar,Vlado Delie

2019 27th Telecommunications Forum (TELFOR)（2019）

引用 1|浏览12

暂无评分

摘要

Modern text-to-speech synthesis systems should deliver speech which is not just intelligible, but whose style corresponds to the domain in which synthesized speech is used. In this paper three approaches based on deep neural networks aimed at synthesis of expressive speech are presented: style code, model re-training and an architecture using shared hidden layers. Their usability is tested on a speech corpus with a limited amount of expressive speech data. A new architecture for transplanting speech styles is also presented and compared with a referent approach from literature.

查看译文

关键词

deep neural networks,expressive speech,style transplantation,text-to-speech synthesis

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要