Audio segmentation-by-classification approach based on factor analysis in broadcast news domain

Diego Castán,Alfonso Ortega,Antonio Miguel,Eduardo Lleida

EURASIP Journal on Audio, Speech, and Music Processing（2014）

引用 22|浏览66

暂无评分

摘要

This paper studies a novel audio segmentation-by-classification approach based on factor analysis. The proposed technique compensates the within-class variability by using class-dependent factor loading matrices and obtains the scores by computing the log-likelihood ratio for the class model to a non-class model over fixed-length windows. Afterwards, these scores are smoothed to yield longer contiguous segments of the same class by means of different back-end systems. Unlike previous solutions, our proposal does not make use of specific acoustic features and does not need a hierarchical structure. The proposed method is applied to segment and classify audios coming from TV shows into five different acoustic classes: speech, music, speech with music, speech with noise, and others. The technique is compared to a hierarchical system with specific acoustic features achieving a significant error reduction.

查看译文

关键词

Audio segmentation,Factor analysis,Within-class variability compensation,Broadcast news,Albayzin 2010 evaluation

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要