Deep Neural Networks For Anger Detection From Real Life Speech Data

2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW)(2017)

引用 23|浏览182
暂无评分
摘要
There has been a lot of previous work on deep neural networks for automatic speech recognition, however, little emphasis has been placed on an investigation of effective deep learning architectures for anger detection from speech. In this paper, inspired by the state-of-the-art deep learning algorithms, we propose a variant of Deep Long Short-Term Memory (LSTM) Recurrent Neural Networks (RNNs), Convolution Neural Networks (CNNs) with 3 x 3 kernels, and LSTM RNNs combined with CNNs, in conjunction with logmel filter bank features and brute forced low-level-descriptors from the standardised ComParE set for speech anger detection. We extensively evaluate the deep networks on a big reallife speech corpus of 26 970 utterances with utterance-level labels collected from a German voice portal, finding that our proposed neural networks significantly outperform traditional modelling algorithms for speech anger detection.
更多
查看译文
关键词
deep neural networks,German voice portal,utterance-level labels,standardised ComParE set,brute forced low-level-descriptors,log-mel filter bank features,LSTM RNN,deep learning architectures,life speech data,big real-life speech corpus,speech anger detection,Convolution Neural Networks,Deep Long Short-Term Memory Recurrent Neural Networks,deep learning algorithms,automatic speech recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要