Development of highly accurate real-time large scale speech recognition system

I Kim,C Park,Keymin Lee, N Kim,J Lee,J Kim,Ian Lane

ICCE（2015）

引用 2|浏览24

暂无评分

摘要

This paper describes the development of the framework and the algorithm for large scale automatic speech recognition systems. Technical advances include the acceleration of decoding speed by leveraging the computational power of many-core graphic processing units (GPU), in order to solve the issue of training data sparseness, improvement in the accuracy by Subspace Gaussian Mixture Models (SGMM), and employing novel methods of language models such as the Instant Language Model Adaptation (ILMA) method. We present the effectiveness of each technique by evaluating it with actual usage data collected from television sets. It is shown that the proposed engine can recognize speech at real time with high accuracy.

查看译文

关键词

gaussian processes,graphics processing units,mixture models,real-time systems,speech coding,speech recognition,gpu,ilma method,sgmm,computational power,decoding speed,highly accurate real-time large scale speech recognition system,instant language model adaptation,many-core graphic processing units,subspace gaussian mixture model,television sets,training data sparseness,data models,decoding,acoustics,accuracy

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要