Speech enhancement using beamforming and non negative matrix factorization for robust speech recognition in the CHiME-3 challenge

Thanh T Vu,Benjamin Bigot,Eng Siong Chng

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)（2015）

引用 14|浏览17

暂无评分

摘要

In this paper we present our contribution to the third CHiME challenge on speech separation and recognition for noisy multi-channel recordings. The use-case of the challenge consists in single speaker utterances recorded in highly non-stationary noisy environments using a 6-microphone array mounted on a tablet computer. The front-end of our system is performing speech enhancement by cascading a cross-correlation-based channel selection, Signal Dependent MVDR beamforming and online source separation based on sparse NMF. The back-end module is a state-of-the-art speech recognition system with DNN acoustic models trained on fMLLR features and a RNN Language Model. Our system reaches an overall WER of 11.94% on real test recordings, achieving a relative improvement of 65% compared to the baseline system.

查看译文

关键词

Speech Enhancement,Automatic Speech Recognition,MVDR Beamforming,Non Negative Matrix Factorization,CHiME challenge

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要