Novel Teager Energy Based Subband Features For Audio Acoustic Scene Detection And Classification

PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT II(2019)

引用 0|浏览19
暂无评分
摘要
Acoustic Scene Classification (ASC) is the task of assigning a semantic label for a given audio sample recorded in different acoustic environments. Sounds carry a significant information about everyday environment scenes, such as bus, tram, airport, concert hall, etc. Thus, extracting the sound signals of these acoustic scenes can be useful to detect and classify the audio signals. In this context, Detection and Classification of Acoustic Scenes and Events (DCASE) 2018 challenge provides a common framework for researchers to propose various approaches with an aim to extract this information present in different acoustical environments. In this paper, to capture the discriminative information between different acoustic scenes, Teager energies with both mel and linear scales are used. These are computed by applying Teager Energy Operator (TEO) on a narrowband filtered signal and are modeled with convolutional neural network (CNN) for detecting and classifying the acoustic scenes or events. The results obtained on the development set gave an overall accuracy of 67.3% using recommended cross-validation setup and thus, overcoming the performance of baseline by 6.3%.
更多
查看译文
关键词
Acoustic Scene Classification (ASC), Convolutional neural network (CNN), Teager Energy Operator (TEO)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要