Speaker Detection Without Models

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING(2005)

引用 18|浏览93
暂无评分
摘要
In order to capture sequential information and to take advantage of extended training data conditions, we developed an algorithm for speaker detection that scores a test segment by comparing it directly to similar instances of that speech in the training data. This non-parametric technique, though at an early stage in its development, achieves error rates close to 1% on the NIST 2001 Extended Data task and performs extremely well in combination with a standard Gaussian Mixture Model system. We also present a new scoring method that significantly improves performance by capturing only positive evidence.
更多
查看译文
关键词
gaussian mixture model,hidden markov models,computer science,learning artificial intelligence,gmm,gaussian processes,sequential analysis,automatic speech recognition,training data,speaker recognition,error rate,loudspeakers,nist
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要