Autonomous Emotion Learning in Speech: A View of Zero-Shot Speech Emotion Recognition
INTERSPEECH(2019)
摘要
Conventionally, speech emotion recognition is achieved using passive learning approaches. Differing from such approaches, we herein propose and develop a dynamic method of autonomous emotion learning based on zero-shot learning. The proposed methodology employs emotional dimensions as the attributes in the zero-shot learning paradigm, resulting in two phases of learning, namely attribute learning and label learning. Attribute learning connects the paralinguistic features and attributes utilising speech with known emotional labels, while label learning aims at defining unseen emotions through the attributes. The experimental results achieved on the CINEMO corpus indicate that zero-shot learning is a useful technique for autonomous speech-based emotion learning, achieving accuracies considerably better than chance level and an attribute-based gold-standard setup. Furthermore, different emotion recognition tasks, emotional attributes, and employed approaches strongly influence system performance.
更多查看译文
关键词
Autonomous emotion learning, speech emotion recognition, zero-shot learning, emotional attributes
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络