谷歌浏览器插件
订阅小程序
在清言上使用

Korean Speech Recognition Using Deep Learning

Suji Lee, Seokjin Han,Sewon Park,Kyeongwon Lee,Jaeyong Lee

KOREAN JOURNAL OF APPLIED STATISTICS(2019)

引用 0|浏览3
暂无评分
摘要
In this paper, we propose an end-to-end deep learning model combining Bayesian neural network with Korean speech recognition. In the past, Korean speech recognition was a complicated task due to the excessive parameters of many intermediate steps and needs for Korean expertise knowledge. Fortunately, Korean speech recognition becomes manageable with the aid of recent breakthroughs in "End-to-end" model. The end-to-end model decodes mel-frequency cepstral coefficients directly as text without any intermediate processes. Especially, Connectionist Temporal Classification loss and Attention based model are a kind of the end-to-end. In addition, we combine Bayesian neural network to implement the end-to-end model and obtain Monte Carlo estimates. Finally, we carry out our experiments on the "WorimalSam" online dictionary dataset. We obtain 4.58% Word Error Rate showing improved results compared to Google and Naver API.
更多
查看译文
关键词
Korean speech recognition,end to end deep learning,Connectionist temporal classification,Attention,Bayesian deep learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要