Simplified VTS-based I-vector extraction in noise-robust speaker recognition

ICASSP(2014)

引用 26|浏览84
暂无评分
摘要
A vector taylor series (VTS) based i-vector extractor was recently proposed for noise-robust speaker recognition by extracting synthesized clean i-vectors to be used in the standard system back-end. This approach brings significant improvements in accuracy for noisy speech conditions. However, this approach incurred such a large computational expense that using the state-of-the-art model size or evaluating large scale evaluations was impractical. In this work, we propose an efficient simplification scheme, named sVTS, in order to show that the VTS approach gives improvements in large scale applications compared to state-of-the-art systems. In contrast to VTS, sVTS generates normalized Baum-Welch statistics and uses the standard i-vector model, making it straightforward to employ on the state-of-the-art i-vector speaker recognition system. Results presented on both the PRISM and the large NIST SRE'12 corpora show that using sVTS i-vectors provides significant improvements in the noisy conditions, and that our proposed simplification result in only a slight degradation with respect to the original VTS approach.
更多
查看译文
关键词
prism,i-vector,normalized baum-welch statistics,statistics,vector taylor series,simplified vts-based i-vector extraction,nist sre'12 corpora,speech synthesis,speaker recognition,noise-robust speaker recognition system,noise compensation,vectors,noisy speaker verification,nist,speech,computational modeling,noise measurement,noise
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要