Nuts and Flakes: a Study of Data Characteristics in Speaker Diarization

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference(2006)

引用 40|浏览15
暂无评分
摘要
Researchers in the speaker diarization community have observed that some audio files show unusually high diarization error rates (DER) (hard to crack "nuts"), and some exhibit hyper-sensitivity to tuning parameters ("flakes"). The goal of this study is to systematically study the features that correlate with such behavior. We calculated over forty features for each of 24 shows from the broadcast news corpus along the dimensions of speaker count, conversation turn, and speaker and show duration. We observed that number of speakers, number of turns, and do-nothing DER (a measure related to the percentage of time the dominant speaker spoke) correlated best with "nuttiness". The do-nothing DER and number of speakers were also the best correlates of "flakiness"
更多
查看译文
关键词
speech processing,broadcast news corpus,data characteristics,diarization error rates,flakes,nuts,speaker diarization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要