Time-Frequency-Bin-Wise Switching Of Minimum Variance Distortionless Response Beamformer For Underdetermined Situations

Kouei Yamaoka,Nobutaka Ono,Shoji Makino,Takeshi Yamada

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)（2019）

引用 15|浏览38

暂无评分

摘要

In this paper, we present a speech enhancement method using two microphones in underdetermined situations. Timefrequency (TF) binary masking is a conventional method of enhancing speech in underdetermined situations by appropriately multiplying each TF component by zero or one. Extending this method, we previously proposed a new method called the time-frequency-bin-wise switching (TFS) beamformer. In this method, we switch multiple preconstructed beamformers in each TF bin, each of which suppresses a particular interferer. However, this method requires the pre-estimation of beamformer fi lter coef fi cients using the target-active period and interferer-wise-active periods as the prior information. In this paper, to overcome this limitation, we formulate the switching and construction of spatial fi lters as a joint optimization problem, which can be understood from two viewpoints: the clustering of the most dominant interferer signal in each TF bin and the construction of a minimum variance distortionless response beamformer using such bins. In an experiment, we con fi rmed that the proposed method was superior to conventional TF masking and fi xed beamforming during speech enhancement regardless of the direction of interferers.

查看译文

关键词

beamforming, time-frequency masking, speech enhancement, underdetermined situation, nonlinear signal processing

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要