Performance Study of a Convolutional Time-Domain Audio Separation Network for Real-Time Speech Denoising

Samuel Sonning,Christian Schuldt,Hakan Erdogan,Scott Wisdom

ICASSP（2020）

引用 18|浏览66

暂无评分

摘要

Time-domain audio separation networks based on dilated temporal convolutions have recently been shown to perform very well compared to methods that are based on a time-frequency representation in speech separation tasks, even outperforming an oracle binary time-frequency mask of the speakers. This paper investigates the performance of such a time-domain network (Conv-TasNet) for speech denoising in a real-time setting, comparing various parameter settings. Most importantly, different amounts of lookahead are evaluated and compared to the baseline of a fully causal model. We show that a large part of the increase in performance between a causal and non-causal model is achieved with a lookahead of only 20 milliseconds, demonstrating the usefulness of even small lookaheads for many real-time applications.

查看译文

关键词

Speech enhancement, noise reduction, deep learning, convolutional neural networks, time domain

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要