Effectiveness in open-set speaker identification

ICCST(2014)

引用 17|浏览9
暂无评分
摘要
This paper presents investigations into the relative effectiveness of two alternative approaches to open-set text-independent speaker identification (OSTI-SI). The methods considered are the recently introduced i-vector and the more traditional GMM-UBM method supported by score normalisation. The study is motivated by the growing need for effective extraction of intelligence and evidence from audio recordings in the fight against crime. OSTI-SI is known to be the most challenging subclass of speaker recognition, and its adoption in criminal investigation applications is further complicated by undesired variations in speech characteristics due to changing levels of environmental noise. In this study, the experimental investigations are conducted using a protocol developed for the identification task, based on the NIST speaker recognition evaluation corpus of 2008. In order to closely cover relevant conditions in the considered application areas and investigate the identification performance in such scenarios, the speech data is contaminated with a range of real-world noise. The paper provides a detailed description of the experimental study and presents a thorough analysis of the results.
更多
查看译文
关键词
gaussian processes,audio recording,mixture models,speaker recognition,speech processing,gmm-ubm method,nist speaker recognition evaluation corpus,osti-si,audio recordings,criminal investigation applications,i-vector,open-set speaker identification,open-set text-independent speaker identification,score normalisation,speech characteristics,gmm-ubm,nist,white noise,contamination,signal to noise ratio,speech,accuracy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要