Sound-event recognition with a companion humanoid

Humanoid Robots(2012)

引用 28|浏览18
暂无评分
摘要
In this paper we address the problem of recognizing everyday sound events in indoor environments with a consumer robot. Sounds are represented in the spectro-temporal domain using the stabilized auditory image (SAI) representation. The SAI is well suited for representing pulse-resonance sounds and has the interesting property of mapping a time-varying signal into a fixed-dimension feature vector space. This allows us to map the sound recognition problem into a supervised classification problem and to adopt a variety of classifications schemes. We present a complete system that takes as input a continuous signal, splits it into significant isolated sounds and noise, and classifies the isolated sounds using a catalogue of learned sound-event classes. The method is validated with a large set of audio data recorded with a humanoid robot in a house. Extended experiments show that the proposed method achieves state-of-the-art recognition scores with a twelve-class problem, while requiring extremely limited memory space and moderate computing power. A first real-time embedded implementation in a consumer robot show its ability to work in real conditions.
更多
查看译文
关键词
audio signal processing,humanoid robots,image representation,robot vision,signal classification,vectors,SAI representation,audio data recording,catalogue,companion humanoid,consumer robot,fixed-dimension feature vector space,humanoid robot,indoor environments,learned sound-event classes,pulse-resonance sound representation,sound-event recognition,spectro-temporal domain,stabilized auditory image representation,supervised classification problem,time-varying signal mapping,twelve-class problem
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要