UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge.

Zajíc Zbyněk,Kunešová Marie,Hrúz Marek,Vaněk Jan,Jan Vanek

INTERSPEECH（2019）

引用 6|浏览66

暂无评分

摘要

In this paper, we present our system developed by the team from the New Technologies for the Information Society (NTIS) research center of the University of West Bohemia in Pilsen, for the Second DIHARD Speech Diarization Challenge. The base of our system follows the currently-standard approach of segmentation, i/x-vector extraction, clustering, and resegmentation. The hyperparameters for each of the subsystems were selected according to the domain classifier trained on the development set of DIHARD II. We compared our system with results from the Kaldi diarization (with i/x-vectors) and combined these systems. At the time of writing of this abstract, our best submission achieved a DER of 23.47% and a JER of 48.99% on the evaluation set (in Track 1 using reference SAD).

查看译文

关键词

speaker diarization, i-vector, x-vector, agglomerative hierarchical clustering, neural network classifier, speaker change detection

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要