A Lecture Transcription System Combining Neural Network Acoustic And Language Models

Peter Bell,Hitoshi Yamamoto,Pawel Swietojanski,Youzheng Wu,Fergus Mcinnes,Chiori Hori,Steve Renals

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5（2013）

引用 30|浏览68

暂无评分

摘要

This paper presents a new system for automatic transcription of lectures. The system combines a number of novel features, including deep neural network acoustic models using multi-level adaptive networks to incorporate out -of-domain information, and factored recurrent neural network language models. We demonstrate that the system achieves large improvements on the TED lecture transcription task from the 2012 IWSLT evaluation-our results are currently the best reported on this task, showing an relative WER reduction of more than 16% compared to the closest competing system from the evaluation.

查看译文

关键词

large vocabulary speech recognition,lecture transcription,deep neural networks,MLAN,factored RNN language model

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要