ASR - VLSP 2021: Automatic Speech Recognition with Blank Label Re-weighting

Ta Bao Thang, Dang Dinh Son, Le Dang Linh,Dang Xuan Vuong,Duong Quang Tien

VNU Journal of Science: Computer Science and Communication Engineering(2022)

引用 0|浏览1
暂无评分
摘要
End-to-end models have significant potential in most languages and recently proved the robustness in ASR tasks. Many robust architectures are proposed, and among many techniques, Recurrent Neural Network - Transducer (RNN-T) shows remarkable success. However, with background noise or reverb in spontaneous speech, this architecture generally suffers from high deletion error problems. For this reason, we propose the blank label re-weighting technique to improve the state-of-the-art Conformer transducer model. Our proposed system adopts the Stochastic Weight Averaging approach, stabilizing the training process. Our work achieved the first rank with a 4.17% of word error rate in Task 2 of the VLSP 2021 Competition.
更多
查看译文
关键词
automatic speech recognition,vlsp,re-weighting
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要