谷歌浏览器插件
订阅小程序
在清言上使用

Few-Shot Bioacoustics Event Detection Using Transductive Inference with Data Augmentation

IEEE sensors letters(2024)

引用 0|浏览4
暂无评分
摘要
Few-shot (FS) sound event detection (SED) is the process of identifying and recognizing specific sounds or events within an audio recording, specifically in the field of bioacoustics. This task is especially challenging due to the substantial unpredictability, complexity of the sounds, and the limited amount of labeled training data available. We recommend a transudative learning technique with data augmentation in response to the recent success of transudative inference (TI) in the field of computer vision. It aims to make predictions about new, unseen data by maximizing the mutual information between features of the labeled training data and the features of the unlabeled test data. We use the detection and classification of acoustic scenes and events (DCASE)-2021 and DCASE-2022 official datasets from the DCASE to do FS bioacoustics event detection experiments. Research findings illustrate that the recommended approach using TI along with spectrogram augmentation improves the performance of the FS bioacoustics event detection system. The proposed method improves the F-score by 0.10 and 0.16 on DCASE-2021 and DCASE-2022, respectively, compared with the average baseline and state-of-the-art methods.
更多
查看译文
关键词
Sensor systems,data augmentation,few-shot (FS),sound event detection (SED),transudative inference (TI)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要