VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

Hannah Muckenhirn
Hannah Muckenhirn
Prashant Sridhar
Prashant Sridhar
Zelin Wu
Zelin Wu
Rif A. Saurous
Rif A. Saurous
Ye Jia
Ye Jia
Ignacio Lopez-Moreno
Ignacio Lopez-Moreno

arXiv: Audio and Speech Processing, Volume abs/1810.04826, 2019, Pages 2728-2732.

Cited by: 91|Bibtex|Views115|DOI:https://doi.org/10.21437/Interspeech.2019-1101
EI
Other Links: academic.microsoft.com|dblp.uni-trier.de|arxiv.org

Abstract:

In this paper, we present a novel system that separates the voice of a target speaker from multi-speaker signals, by making use of a reference signal from the target speaker. We achieve this by training two separate neural networks: (1) A speaker recognition network that produces speaker-discriminative embeddings; (2) A spectrogram maskin...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments