End-to-End Speech Emotion Recognition Based on One-Dimensional Convolutional Neural Network

Mengna Gao,Jing Dong,Dongsheng Zhou,Qiang Zhang,Deyun Yang

Proceedings of the 2019 3rd International Conference on Innovation in Artificial Intelligence（2019）

引用 10|浏览11

暂无评分

摘要

Real-time speech emotion recognition has always been a problem. To this end, we proposed an end-to-end speech emotion recognition model based on one-dimensional convolutional neural network, which contains only three convolution layers, two pooling layers and one full-connected layer. Through Adam optimization algorithm and back propagation mechanism, more discriminative features can be extracted continuously. Our model is quite simple in structure and easy to quickly complete the emotional classification task. Compared with traditional methods, there is no need to carry out the complex process of manually extracting features, and the model can automatically learn the emotional features from raw speech signals. In the emotional recognition experiments with EMODB, CASIA, IEMOCAP, and CHEAVD four speech databases, relatively high recognition rates were obtained. Experiments show that the proposed algorithm is of great benefit to the implementation of real-time speech emotion recognition.

查看译文

关键词

Convolutional Neural Network, End-to-End, Speech Emotion Recognition

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要