Determined Audio Source Separation with Multichannel Star Generative Adversarial Network

2020 IEEE 30th International Workshop on Machine Learning for Signal Processing (MLSP)（2020）

引用 7|浏览12

暂无评分

摘要

This paper proposes a multichannel source separation approach, which uses a star generative adversarial network (StarGAN) to model power spectrograms of sources. Various studies have shown the significant contributions of a precise source model to the performance improvement in audio source separation, which indicates the importance of developing a better source model. In this paper, we explore the potential of StarGAN for modeling source spectrograms and investigate the effectiveness of the StarGAN source model in determined multichannel source separation by incorporating it into a frequency-domain independent component analysis (ICA) framework. The experimental results reveal that the proposed StarGAN-based method outperformed conventional methods that use non-negative matrix factorization (NMF) or a variational autoencoder (VAE) for source spectrogram modeling.

查看译文

关键词

Multichannel audio signal processing,determined source separation,star generative adversarial network (StarGAN),spectrogram modeling,deep generative model

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要