Robust Energy-efficient Audio-based Anger Detection System for Noisy Environments.

Fangfang Zhu-Zhou, Diana Tejera-Berengue,Manuel Rosa-Zurera,Manuel Utrilla-Manso,Roberto Gil-Pita

2023 IEEE 19th International Conference on Intelligent Computer Communication and Processing (ICCP)（2023）

引用 0|浏览2

暂无评分

摘要

Emotional speech recognition (ESR) seeks to identify human emotions by capturing voice attributes, which can be invaluable for understanding speakers’ moods, attitudes, and feelings. This study investigates affective computing for anger recognition in audio signals. We aim to develop a robust and efficient model capable of detecting anger across various real-world scenarios by employing feature extraction, and virtual enlargement techniques. The experimental setup includes multiple feature sets with different computational costs, and virtual enlargement strategies to simulate different environmental conditions. Results show that models trained with augmented data outperform those trained with non-augmented data, highlighting the importance of data augmentation techniques in enhancing model performance and generalization. Furthermore, additional experiments were conducted to confirm the robustness of data augmentation techniques, which consistently demonstrates their positive impact on model performance. Finally, the study contributes to the field of ESR, particularly anger recognition, and its potential applications in violence detection and public safety. The findings demonstrate the effectiveness of feature extraction, virtual enlargement, and data augmentation techniques in improving model performance across various environmental conditions, thereby enhancing the ability to identify anger in audio signals.

查看译文

关键词

anger detection,audio processing,computational cost

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要