Content-Aware Local Variability Vector For Speaker Verification With Short Utterance

Liping Chen,Kong Aik Lee,Eng-Siong Chng,Bin Ma,Haizhou Li,Li Rong Dai

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)（2016）

引用 15|浏览55

暂无评分

摘要

I-vector has shown to be very effective in speaker verification with long-duration speech utterances. But when test utterances are of short duration, content mismatch between the enrollment and test utterances limit the performance of i-vector system. This paper proposes to extract local session variability vectors on different phonetic classes from the utterances instead of estimating the session variability across the whole utterance as i-vector does. Using the posteriors given by a deep neural network (DNN) trained for phone state classification, the local vectors represent the session variability contained in specific phonetic content. Our experiments show that the content-aware local vectors are better at coping with the content mismatch between training and test utterances of short durations for text-independent, text-constrained and text-dependent tasks.

查看译文

关键词

content-aware local variability,short-duration utterance,speaker verification

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要