Improving speech recognition in reverberation using a room-aware deep neural network and multi-task learning

Ritwik Giri,Michael L. Seltzer,Jasha Droppo,Dong Yu

IEEE International Conference on Acoustics, Speech and SP（2015）

引用 106|浏览102

暂无评分

摘要

In this paper, we propose two approaches to improve deep neural network (DNN) acoustic models for speech recognition in reverberant environments. Both methods utilize auxiliary information in training the DNN but differ in the type of information and the manner in which it is used. The first method uses parallel training data for multi-task learning, in which the network is trained to perform both a primary senone classification task and a secondary feature enhancement task using a shared representation. The second method uses a parameterization of the reverberant environment extracted from the observed signal to train a room-aware DNN. Experiments were performed on the single microphone task of the REVERB Challenge corpus. The proposed approach obtained a word error rate of 7.8% on the SimData test set, which is lower than all reported systems using the same training data and evaluation conditions, and 27.5% on the mismatched RealData test set, which is lower than all but two systems.

查看译文

关键词

acoustic noise,learning (artificial intelligence),neural nets,reverberation,signal classification,speech processing,speech recognition,DNN acoustic models,DNN training,REVERB Challenge corpus,SimData test set,auxiliary information,mismatched RealData test set,multitask learning,parallel training data,primary senone classification task,reverberant environments,reverberation,room aware DNN,room aware deep neural network,secondary feature enhancement task,shared representation,single microphone task,speech recognition,Multi-task learning,deep neural network,reverberation,room impulse response

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要