The 2001 GMTK-based SPINE ASR system

Özgür Çetin,Harriet J. Nock,Katrin Kirchhoff,Jeff A. Bilmes,Mari Ostendorf

INTERSPEECH（2002）

引用 27|浏览29

暂无评分

摘要

This paper provides a detailed description of the University of Washington automatic speech recognition (ASR) system for the 2001 DARPA SPeech In Noisy Environments (SPINE) task. Our system makes heavy use of the graphical modeling toolkit (GMTK), a general purpose graphical modeling-based ASR sys- tem that allows arbitrary parameter tying, flexible determinis- tic and stochastic dependencies between variables, and a gener- alized maximum likelihood parameter estimation algorithm. In our SPINE system, GMTK was used for acoustic model training whereas feature extraction, speaker adaptation, and first-pass de- coding were performed by HTK. Our integrated GMTK/HTK sys- tem demonstrates the relative merits provided by each tool. Novel aspects of our SPINE system include the capturing of correlations among feature vectors via a globally-shared factored sparse inverse covariance matrix and generalized EM training.

查看译文

关键词

covariance matrix,graphical model,maximum likelihood,parameter estimation,automatic speech recognition,feature extraction,feature vector

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要