Speaker Diarization Features: The UPM Contribution to the RT09 Evaluation

Audio, Speech, and Language Processing, IEEE Transactions(2012)

引用 19|浏览0
暂无评分
摘要
Two new features have been proposed and used in the Rich Transcription Evaluation 2009 by the Universidad Politécnica de Madrid, which outperform the results of the baseline system. One of the features is the intensity channel contribution, a feature related to the location of the speaker. The second feature is the logarithm of the interpolated fundamental frequency. It is the first time that both features are applied to the clustering stage of multiple distant microphone meetings diarization. It is shown that the inclusion of both features improves the baseline results by 15.36% and 16.71% relative to the development set and the RT 09 set, respectively. If we consider speaker errors only, the relative improvement is 23% and 32.83% on the development set and the RT09 set, respectively.
更多
查看译文
关键词
rt09 set,upm contribution,baseline system,relative improvement,clustering stage,baseline result,development set,rt09 evaluation,new feature,speaker error,speaker diarization features,rich transcription evaluation,universidad polite
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要