Combined Data Augmentation on EANN to Identify Indoor Anomalous Sound Event

Xiyu Song, Junhan Xiong,Mei Wang, Qingshan Mei, Xiaodong Lin

APPLIED SCIENCES-BASEL(2024)

引用 0|浏览2
暂无评分
摘要
Indoor abnormal sound event identification refers to the automatic detection and recognition of abnormal sounds in an indoor environment using computer auditory technology. However, the process of model training usually requires a large amount of high-quality data, which can be time-consuming and costly to collect. Utilizing limited data has become another preferred approach for such research, but it introduces overfitting issues for machine learning models on small datasets. To overcome this issue, we proposed and validated the framework of combining the offline augmentation of raw audio and online augmentation of spectral features, making the application of small datasets in indoor anomalous sound event identification more feasible. Along with this, an improved two-dimensional audio convolutional neural network (EANN) was also proposed to evaluate and compare the impacts of different data augmentation methods under the framework on the sensitivity of sound event identification. Moreover, we further investigated the performance of four combinations of data augmentation techniques. Our research shows that the proposed combined data augmentation method has an accuracy of 97.4% on the test dataset, which is 10.6% higher than the baseline method. This demonstrates the method's potential in the identification of indoor abnormal sound events.
更多
查看译文
关键词
data augmentation,abnormal indoor sound,acoustic event recognition,Mel frequency cepstral coefficient,convolutional neural network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要