Graph-based semi-supervised acoustic modeling in DNN-based speech recognition

SLT(2014)

引用 13|浏览21
暂无评分
摘要
This paper describes the combination of two recent machine learning techniques for acoustic modeling in speech recognition: deep neural networks (DNNs) and graph-based semi-supervised learning (SSL). While DNNs have been shown to be powerful supervised classifiers and have achieved considerable success in speech recognition, graph-based SSL can exploit valuable complementary information derived from the manifold structure of the unlabeled test data. Previous work on graph-based SSL in acoustic modeling has been limited to frame-level classification tasks and has not been compared to, or integrated with, state-of-the-art DNN/HMM recognizers. This paper represents the first integration of graph-based SSL with DNN based speech recognition and analyzes its effect on word recognition performance. The approach is evaluated on two small vocabulary speech recognition tasks and shows a significant improvement in HMM state classification accuracy as well as a consistent reduction in word error rate over a state-of-the-art DNN/HMM baseline.
更多
查看译文
关键词
hmm state classification accuracy,acoustic modeling,deep neural networks,graph-based semisupervised acoustic modeling,speech recognition,word recognition performance,graph-based semisupervised learning,learning (artificial intelligence),pattern classification,unlabeled test data,semi-supervised learning,supervised classifiers,machine learning techniques,frame-level classification tasks,graph theory,graph-based ssl,dnn-based speech recognition,small vocabulary speech recognition tasks,graph-based learning,hidden markov models,neural nets,semi supervised learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要