Cnn-Based Virtual Microphone Signal Estimation For Mpdr Beamforming In Underdetermined Situations

2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)(2019)

引用 3|浏览45
暂无评分
摘要
In this paper, we propose a novel approach to virtually increasing the number of microphone elements between two real microphones to improve speech enhancement performance in underdetermined situations. The virtual microphone technique, with which virtual signals in the audio signal domain are estimated by linearly interpolating the phase and nonlinearly interpolating the amplitude independently on the basis of beta-divergence, has been recently proposed and experimentally shown to be effective in improving speech enhancement performance. Furthermore, it has been reported that the performance tends to improve as the nonlinearity is improved. However, one drawback of this method is that the interpolation is employed in each time-frequency bin independently, in which the spectral and temporal structures of speech signals are ignored. To address this problem and improve the nonlinearity, motivated by the high capability of neural networks to model nonlinear functions and speech spectrograms, in this paper, we propose an alternative method of amplitude interpolation. In this method, we employ a convolutional neural network as an amplitude estimator that minimizes the mean squared error between the outputs of a minimum power distortionless response (MPDR) beamformer and the target speech signals. The experimental results revealed that the proposed method showed high potential for improving speech enhancement performance, which was not only superior to that of the conventional virtual microphone technique but also the performance in the corresponding determined situation.
更多
查看译文
关键词
nonlinear functions,speech spectrograms,amplitude interpolation,amplitude estimator,minimum power distortionless response beamformer,MPDR,target speech signals,speech enhancement performance,virtual microphone technique,CNN-based virtual microphone signal estimation,microphone elements,microphones,virtual signals,audio signal domain,linearly interpolating,nonlinearity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要