CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus
LREC, pp. 4197-4203, 2020.
Spoken language translation has recently witnessed a resurgence in popularity, thanks to the development of end-to-end models and the creation of new corpora, such as Augmented LibriSpeech and MuST-C. Existing datasets involve language pairs with English as a source language, involve very specific domains or are low resource. We introdu...More
PPT (Upload PPT)