Speaker Diarization: A Perspective On Challenges And Opportunities From Theory To Practice

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)(2017)

引用 28|浏览92
暂无评分
摘要
This paper discusses some challenges and opportunities in developing a speaker diarization system for operation on real world call center telephony data. We contrast so me of the differences between a standard data set akin to NIST evaluations and those found in call centers. In exploring these differences we discovered vulnerabilities and proposed changes to address them.In moving from theory into practice we introduce two tasks in which speaker diarization and recognition can be leveraged. First, we show that speaker diarization and recognition systems can be integrated to find the common speaker (the call center agent) across multiple calls and consequently their role. Furthermore, once the role is determined the corresponding speech recognition output can be analyzed to determine the type of support call.
更多
查看译文
关键词
Speaker Diarization, Speaker Recognition, Role Modeling, Call Center Data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要