Factor analysis based VTS discriminative adaptive training

ICASSP（2012）

引用 6|浏览20

暂无评分

摘要

Vector Taylor Series (VTS) model based compensation is a powerful approach for noise robust speech recognition. An important extension to this approach is VTS adaptive training (VAT), which allows canonical models to be estimated on diverse noise-degraded training data. These canonical model can be estimated using EM-based approaches, allowing simple extensions to discriminative VAT (DVAT). However to ensure a diagonal corrupted speech covariance matrix the Jacobian (loading matrix) relating the noise and clean speech is diagonalised. In this work an approach for yielding optimal diagonal loading matrices based on minimising the expected KL-divergence between the diagonal loading matrix and “correct” distributions is proposed. The performance of DVAT using the standard and optimal diagonalisation was evaluated on both in-car collected data and the Aurora4 task.

查看译文

关键词

clean speech,loading matrix,jacobian,factor analysis based vts discriminative adaptive training,speech recognition,noise,vector taylor series model based compensation,powerful approach,training,covariance matrices,noise robustness,noise robust speech recognition,em-based approach,in-car collected data,canonical models,noise speech,diagonal corrupted speech covariance matrix,generative processes,optimal diagonal loading matrices,adaptive training,jacobian matrices,kl-divergence,dvat,vectors,aurora4 task,discriminative vat,kl divergence,speech

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要