Directional Sound-Capture System With Acoustic Array Based on FPGA

Weiming Xiang,Yu Liu, Yiwei Zhou,Yu Wu

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT(2024)

引用 0|浏览1
暂无评分
摘要
The front-end speech enhancement system is regarded as an essential component for maximizing the performances of smart technology for voice interaction in complicated live acoustic scenes. Existing research has had the following limitations: short-distance detection, poor suppression of nonstationary interferers, and inaccurate estimation of the direction of arrival. To tackle these issues, this article proposes a 48-channel acoustic array system for directional sound capture (DSC). This system implements a field-programmable gate array (FPGA)-based acquisition and signal processing algorithm: broadband acoustic beamformer based on audio-visual (A-V). To the authors' knowledge, this is the first time that a DSC system that uses A-V for terminal voice interaction has been implemented by FPGA. Experiments were set up in diverse acoustic scenes to evaluate the system's performance. The results imply that the proposed system can be widely applied to smart scenes in complicated acoustic environments contaminated with intense background noise and competing nonstationary interferers, as well as provide real-time speech recognition and classification.
更多
查看译文
关键词
Acoustics,Array signal processing,Acoustic arrays,Field programmable gate arrays,Direction-of-arrival estimation,Signal processing algorithms,Broadband communication,Acoustic array system,audio-visual (A-V),broadband acoustic beamformer,directional sound capture (DSC),field-programmable gate array (FPGA)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要