Acoustic Recognition of Multiple Bird Species Based on Penalized Maximum Likelihood

Signal Processing Letters, IEEE  (2015)

引用 40|浏览23
暂无评分
摘要
Automatic system for recognition of multiple bird species in audio recordings is presented. Time-frequency segmentation of the acoustic scene is obtained by employing a sinusoidal detection algorithm, which does not require any estimate of noise and is able to handle multiple simultaneous bird vocalizations. Each segment is characterized as a sequence of frequencies over time, referred to as a frequency track. Each bird species is represented by a hidden Markov model that models the temporal evolution of frequency tracks. The decision on the number and identity of bird species in a given recording is obtained based on maximizing the overall likelihood of the set of detected segments, with a penalization applied for increasing the number of bird models used. Experimental evaluations are performed on audio field recordings containing 30 bird species. The presence of multiple bird species is simulated by joining the set of detected segments from several bird species. Results show that the proposed method can achieve recognition performance for multiple bird species not far from that obtained for single bird species, and considerably outperforms majority voting methods.
更多
查看译文
关键词
markov processes,acoustic signal processing,audio recording,bioacoustics,maximum likelihood estimation,time-frequency analysis,acoustic recognition,audio field recordings,frequency track,hidden markov model,multiple bird species,multiple simultaneous bird vocalizations,penalized maximum likelihood,sinusoidal detection algorithm,time-frequency segmentation,bic,bird species recognition,hidden markov models,maximum likelihood,partition,penalization,sinusoid detection,noise,time frequency analysis,acoustics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要