Automatic Audio Classification and Speaker Identification for Video Content Analysis

Shu-Chang Liu,Jing Bi,Zhiqiang Jia,Rui Chen,Jie Chen,Minmin Zhou

Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference（2007）

引用 6|浏览4

暂无评分

摘要

Recently, more literatures proposed to apply audio content analysis techniques in content-based video parsing. This paper presents our works on audio classification and speaker identification techniques for video content analysis. Firstly, soundtrack extracted from video stream is partitioned into homogeneous segments using rule and Support Vector Machine(SVM) based classifier. Secondly, fixed-length speech clips randomly selected from speech segments are clustered into several clusters based on spectral clustering techniques. The clustered speech feature datasets initialize and train Gaussian Mixture Model(GMM) for each speaker. Finally, the trained GMMs accomplish speaker identification. Experimental results confirm the validity of the proposed scheme.

查看译文

关键词

spectral clustering,speech segmentation,support vector machines,speaker recognition,support vector machine,gaussian mixture model,gaussian processes

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要