SVitchboard-II and FiSVer-I: Crafting high quality and low complexity conversational english speech corpora using submodular function optimization.

Computer Speech & Language(2017)

引用 6|浏览109
暂无评分
摘要
•We introduced a set of conversational English speech corpora with high acoustic quality and limited vocabulary derived from the Switchboard-I and Fisher datasets.•We investigated numerous state-of-the-art submodular function optimization procedures, including SCSC/SCSK, DS and SFM optimization.•We surveyed different submodular function instantiations, where both ‘‘acoustic quality’’ and vocabulary size are adeptly measured via various submodular functions.•We provided baseline word recognition results on all of the resultant speech corpora for both Gaussian mixture model (GMM) and deep neural network (DNN)-based systems.•We had released all of the corpora definitions and Kaldi training recipes for free in the public domain.
更多
查看译文
关键词
Submodular function optimization,Automatic speech recognition,Speech corpus
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要