Recent Progresses on Deep Learning for Speech Recognition

Handbook of Pattern Recognition and Computer Vision (6th Edition)(2020)

引用 0|浏览53
暂无评分
摘要
We discuss two important areas in deep learning based automatic speech recognition (ASR) where significant research attention has been given recently: end-to-end (E2E) modeling and robust ASR. E2E modeling aims at simplifying the modeling pipeline and reducing the dependency on domain knowledge by introducing sequence-to-sequence translation models. These models usually optimize the ASR objectives end-to-end with few assumptions, and can potentially improve the ASR performance when abundant training data is available. Robustness is critical to, but is still less than desired in, practical ASR systems. Many new attempts, such as teacher-student learning, adversarial training, improved speech separation and enhancement, have been made to improve the systems’ robustness. We summarize the recent progresses in these two areas with a focus on the successful technologies proposed and the …
更多
查看译文
关键词
deep learning,speech recognition,recent progresses on
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要