Fully Automatic Speaker Separation System, with Automatic Enrolling of Recurrent Speakers.

Raphael Cohen,Orgad Keller,Jason Levy, Russell Levy, Micha Breakstone, Amit Ashkenazi

Interspeech(2018)

引用 23|浏览14
暂无评分
摘要
We present a system to enable speaker separation and identification, designed to operate without requiring any effort from the end-user. In the system, single channel conversations are transformed into i-vectors, clustered into speakers and matched to a database of known speakers. Enrollment is automatic and a voice print is constructed for the recording user, taking advantage of the meta-data identifying that user's conversations. Further information is used when available from other information sources such as video and the ASR transcribed content to identify speakers. We describe the system architecture, novel unsupervised enrollment algorithm and describe the difficulties encountered in solving this problem.
更多
查看译文
关键词
speaker separation, diarization, speech recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要