The CPQD-Unicamp system for Blizzard Challenge 2021

The Blizzard Challenge 2021(2021)

引用 0|浏览7
暂无评分
摘要
This paper presents the CPQD-UNICAMP text-to-speech system for Blizzard Challenge 2021. The system consists of a bilingual linguistic front-end, an acoustic model based on Tacotron2 and a Parallel Wavegan neural vocoder. A multispeaker Brazilian Portuguese dataset was added to the Blizzard 2021 dataset in order to train a bilingual acoustic model. The system was later fine-tuned with the target speaker data. Sentences were classified according to the punctuation type and a specialized model was trained for each category to better model the intonation pattern of non-declarative sentences. The Blizzard Challenge evaluation for the hub task shows that the proposed strategy achieved high naturalness, intelligibility and similarity results.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要