Tundra: A Multilingual Corpus Of Found Data For Tts Research Created With Light Supervision

A. Stan,O. Watts, Y. Mamiya,M. Giurgiu, R. A. J. Clarke,J. Yamagishi,S. King

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5（2013）

引用 49|浏览28

暂无评分

摘要

Simple4All Tundra (version 1.0) is the first release of a standardised multilingual corpus designed for text-to-speech research with imperfect or found data. The corpus consists of approximately 60 hours of speech data from audiobooks in 14 languages, as well as utterance-level alignments obtained with a lightly-supervised process. Future versions of the corpus will include finer-grained alignment and prosodic annotation, all of which will be made freely available. This paper gives a general outline of the data collected so far, as well as a detailed description of how this has been done, emphasizing the minimal language-specific knowledge and manual intervention used to compile the corpus. To demonstrate its potential use, text to-speech systems have been built for all languages using unsupervised or lightly supervised methods, also briefly presented in the paper.

查看译文

关键词

multilingual corpus,light supervision,imperfect data,found data,text-to-speech,audiobook data

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要