Modeling the Affective Content of Music with a Gaussian Mixture Model

T. Affective Computing(2015)

引用 61|浏览52
暂无评分
摘要
Modeling the association between music and emotion has been considered important for music information retrieval and affective human computer interaction. This paper presents a novel generative model called acoustic emotion Gaussians (AEG) for computational modeling of emotion. Instead of assigning a music excerpt with a deterministic (hard) emotion label, AEG treats the affective content of music as a (soft) probability distribution in the valence-arousal space and parameterizes it with a Gaussian mixture model (GMM). In this way, the subjective nature of emotion perception is explicitly modeled. Specifically, AEG employs two GMMs to characterize the audio and emotion data. The fitting algorithm of the GMM parameters makes the model learning process transparent and interpretable. Based on AEG, a probabilistic graphical structure for predicting the emotion distribution from music audio data is also developed. A comprehensive performance study over two emotion-labeled datasets demonstrates that AEG offers new insights into the relationship between music and emotion (e.g., to assess the “affective diversity” of a corpus) and represents an effective means of emotion modeling. Readers can easily implement AEG via the publicly available codes. As the AEG model is generic, it holds the promise of analyzing any signal that carries affective or other highly subjective information.
更多
查看译文
关键词
gaussian mixture model,arousal,acoustic emotion gaussians,valence,emotion distribution prediction,gaussian processes,emotion computational modeling,human computer interaction,subjectivity,music information retrieval,fitting algorithm,learning (artificial intelligence),music,information retrieval,probabilistic graphical structure,valence-arousal space,mixture models,affective diversity assessment,aeg,generative model,music affective content modeling,emotion recognition,acoustic signal processing,emotion data characterization,emotion-labeled datasets,probability distribution,affective human computer interaction,audio signal processing,model learning process,music emotion recognition,audio data characterization,probability,emotion perception,probabilistic logic,computational modeling,predictive models,data models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要