Towards Real-Time Single-Channel Singing-Voice Separation with Pruned Multi-Scaled Densenets

Markus Huber,Gunther Schindler,Christian Schorkhuber,Wolfgang Roth,Franz Pernkopf,Holger Froning

ICASSP（2020）

引用 3|浏览14

暂无评分

摘要

Modern musical source separation systems based on deep neural networks reach unprecedented levels of separation quality. However, harnessing the power of these large-scale models in typical audio production environments, which frequently offer only limited computing resources while demanding real-time processing, remains challenging. We extend the multi-scaled DenseNet in several aspects to facilitate real-time source separation scenarios. Specifically, we reduce the computational requirements by inferring Melscaled masks and decrease the model size via effective use of bottleneck layers, while improving performance using a deep clustering objective. In addition, we are able to further increase the model efficiency by applying parameterized structured pruning of convolutional weights without any significant impact on the separation performance. We significantly reduce the model size and increase the computational efficiency by a factor of 1.6 and 4.3, respectively, while maintaining the separation performance.

查看译文

关键词

Musical Source Separation,Real-time,Parameterized Structured Pruning,Multi-scaled DenseNet

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要