CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus

Wang Changhan
Wang Changhan
Pino Juan
Pino Juan
Wu Anne
Wu Anne

LREC, pp. 4197-4203, 2020.

Cited by: 0|Bibtex|Views43
EI
Other Links: arxiv.org|dblp.uni-trier.de|academic.microsoft.com

Abstract:

Spoken language translation has recently witnessed a resurgence in popularity, thanks to the development of end-to-end models and the creation of new corpora, such as Augmented LibriSpeech and MuST-C. Existing datasets involve language pairs with English as a source language, involve very specific domains or are low resource. We introdu...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments