Speaker change detection with privacy-preserving audio cues.

Sree Hari Krishnan Parthasarathi,Mathew Magimai-Doss,Daniel Gatica-Perez,Hervé Bourlard

ICMI-MLMI（2009）

引用 7|浏览47

暂无评分

摘要

ABSTRACTIn this paper we investigate a set of privacy-sensitive audio features for speaker change detection (SCD) in multiparty conversations. These features are based on three different principles: characterizing the excitation source information using linear prediction residual, characterizing subband spectral information shown to contain speaker information, and characterizing the general shape of the spectrum. Experiments show that the performance of the privacy-sensitive features is comparable or better than that of the state-of-the-art full-band spectral-based features, namely, mel frequency cepstral coefficients, which suggests that socially acceptable ways of recording conversations in real-life is feasible.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要