Korean Speech Recognition Using Deep Learning

Suji Lee, Seokjin Han,Sewon Park,Kyeongwon Lee,Jaeyong Lee

KOREAN JOURNAL OF APPLIED STATISTICS（2019）

引用 0|浏览3

暂无评分

摘要

In this paper, we propose an end-to-end deep learning model combining Bayesian neural network with Korean speech recognition. In the past, Korean speech recognition was a complicated task due to the excessive parameters of many intermediate steps and needs for Korean expertise knowledge. Fortunately, Korean speech recognition becomes manageable with the aid of recent breakthroughs in "End-to-end" model. The end-to-end model decodes mel-frequency cepstral coefficients directly as text without any intermediate processes. Especially, Connectionist Temporal Classification loss and Attention based model are a kind of the end-to-end. In addition, we combine Bayesian neural network to implement the end-to-end model and obtain Monte Carlo estimates. Finally, we carry out our experiments on the "WorimalSam" online dictionary dataset. We obtain 4.58% Word Error Rate showing improved results compared to Google and Naver API.

查看译文

关键词

Korean speech recognition,end to end deep learning,Connectionist temporal classification,Attention,Bayesian deep learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要