Speaker Detection Without Models

Daniel Gillick,Stephen Stafford,Barbara Peskin

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING（2005）

引用 18|浏览93

暂无评分

摘要

In order to capture sequential information and to take advantage of extended training data conditions, we developed an algorithm for speaker detection that scores a test segment by comparing it directly to similar instances of that speech in the training data. This non-parametric technique, though at an early stage in its development, achieves error rates close to 1% on the NIST 2001 Extended Data task and performs extremely well in combination with a standard Gaussian Mixture Model system. We also present a new scoring method that significantly improves performance by capturing only positive evidence.

查看译文

关键词

gaussian mixture model,hidden markov models,computer science,learning artificial intelligence,gmm,gaussian processes,sequential analysis,automatic speech recognition,training data,speaker recognition,error rate,loudspeakers,nist

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要