Toward a practical implementation of exemplar-based noise robust ASR

Jort F. Gemmeke,Antti Hurmalainen,Tuomas Virtanen,Yang Sun

European Signal Processing Conference（2011）

引用 34|浏览18

暂无评分

摘要

In previous work it was shown that, at least in principle, an exemplar-based approach to noise robust ASR is possible. The method, sparse representation based classification (SC), works by modelling noisy speech as a sparse linear combination of speech and noise exemplars. After recovering the sparsest possible linear combination of labelled exemplars, noise robust posterior likelihoods are estimated by using the weights of the exemplars as evidence of the state labels underlying exemplars. Although promising recognition accuracies at low SNRs were obtained, the method was impractical due to its slow execution speed. Moreover, the performance was not as good on noisy speech corrupted by noise types not represented by the noise exemplars. The importance of sparsity was poorly understood, and the influence of the size of the exemplar-dictionary was unclear. In this paper we investigate all these issues, and we show for example that speedups of a factor 28 can be obtained by using modern GPUs, bringing its execution speed within range to practical applications.

查看译文

关键词

graphics processing units,maximum likelihood estimation,signal classification,signal representation,speech recognition,gpu,sc,snr,automatic speech recognition,exemplar-based noise robust asr,noise robust posterior likelihood,noisy speech modelling,sparse linear combination,sparse representation based classification,noise measurement,accuracy,dictionaries,signal to noise ratio,speech

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要