A Speech Test Set of Practice Business Presentations with Additional Relevant Texts

CoRR(2019)

引用 6|浏览0
暂无评分
摘要
We present a test corpus of audio recordings and transcriptions of presentations of students’ enterprises together with their slides and web-pages. The corpus is intended for evaluation of automatic speech recognition (ASR) systems, especially in conditions where the prior availability of in-domain vocabulary and named entities is benefitable. The corpus consists of 39 presentations in English, each up to 90 s long. The speakers are high school students from European countries with English as their second language. We benchmark three baseline ASR systems on the corpus and show their imperfection.
更多
查看译文
关键词
Speech recognition,ASR evaluation,Speech corpus,Non-native English
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要