Subband autocorrelation features for video soundtrack classification

Acoustics, Speech and Signal Processing（2013）

引用 7|浏览23

暂无评分

摘要

Inspired by the system presented in [1], we have developed novel auditory-model-based features that preserve the fine time structure lost in conventional frame-based features. While the original auditory model is computationally intense, we present a simpler system that runs about ten times faster but achieves equivalent performance. We use these features for video soundtrack classification with the Columbia Consumer Video dataset, showing that the new features alone are roughly comparable to traditional MFCCs, but combining classifiers based on both features achieves a substantial mean Average Precision improvement of 15% over the MFCC baseline.

查看译文

关键词

acoustic signal processing,audio signal processing,cepstral analysis,video signal processing,Columbia consumer video dataset,MFCC baseline,auditory model,auditory-model-based features,conventional frame-based features,equivalent performance,subband autocorrelation features,substantial mean average precision improvement,video soundtrack classification,Acoustic signal processing,Auditory models,Multimedia databases,Video indexing

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要