BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution
CoRR(2023)
摘要
Speech bandwidth extension (BWE) has demonstrated promising performance in
enhancing the perceptual speech quality in real communication systems. Most
existing BWE researches primarily focus on fixed upsampling ratios,
disregarding the fact that the effective bandwidth of captured audio may
fluctuate frequently due to various capturing devices and transmission
conditions. In this paper, we propose a novel streaming adaptive bandwidth
extension solution dubbed BAE-Net, which is suitable to handle the
low-resolution speech with unknown and varying effective bandwidth. To address
the challenges of recovering both the high-frequency magnitude and phase speech
content blindly, we devise a dual-stream architecture that incorporates the
magnitude inpainting and phase refinement. For potential applications on edge
devices, this paper also introduces BAE-NET-lite, which is a lightweight,
streaming and efficient framework. Quantitative results demonstrate the
superiority of BAE-Net in terms of both performance and computational
efficiency when compared with existing state-of-the-art BWE methods.
更多查看译文
关键词
adaptive bandwidth extension,low-complexity,magnitude inpainting,phase refinement
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要