Graph-based semi-supervised acoustic modeling in DNN-based speech recognition

SLT（2014）

引用 13|浏览21

暂无评分

摘要

This paper describes the combination of two recent machine learning techniques for acoustic modeling in speech recognition: deep neural networks (DNNs) and graph-based semi-supervised learning (SSL). While DNNs have been shown to be powerful supervised classifiers and have achieved considerable success in speech recognition, graph-based SSL can exploit valuable complementary information derived from the manifold structure of the unlabeled test data. Previous work on graph-based SSL in acoustic modeling has been limited to frame-level classification tasks and has not been compared to, or integrated with, state-of-the-art DNN/HMM recognizers. This paper represents the first integration of graph-based SSL with DNN based speech recognition and analyzes its effect on word recognition performance. The approach is evaluated on two small vocabulary speech recognition tasks and shows a significant improvement in HMM state classification accuracy as well as a consistent reduction in word error rate over a state-of-the-art DNN/HMM baseline.

查看译文

关键词

hmm state classification accuracy,acoustic modeling,deep neural networks,graph-based semisupervised acoustic modeling,speech recognition,word recognition performance,graph-based semisupervised learning,learning (artificial intelligence),pattern classification,unlabeled test data,semi-supervised learning,supervised classifiers,machine learning techniques,frame-level classification tasks,graph theory,graph-based ssl,dnn-based speech recognition,small vocabulary speech recognition tasks,graph-based learning,hidden markov models,neural nets,semi supervised learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要