CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus
LREC, pp. 4197-4203, 2020.
EI
Abstract:
Spoken language translation has recently witnessed a resurgence in popularity, thanks to the development of end-to-end models and the creation of new corpora, such as Augmented LibriSpeech and MuST-C. Existing datasets involve language pairs with English as a source language, involve very specific domains or are low resource. We introdu...More
Code:
Data:
Full Text
Tags
Comments