谷歌浏览器插件
订阅小程序
在清言上使用

Remixing Music with Visual Conditioning

2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020)(2020)

引用 1|浏览10
暂无评分
摘要
We propose a visually conditioned music remixing system by incorporating deep visual and audio models. The method is based on a state of the art audio-visual source separation model which performs music instrument source separation with video information. We modified the model to work with user-selected images instead of videos as visual input during inference to enable separation of audio-only content. Furthermore, we propose a remixing engine that generalizes the task of source separation into music remixing. The proposed method is able to achieve improved audio quality compared to remixing performed by the separate-and-add method with a state-of-the-art audio-visual source separation model.
更多
查看译文
关键词
remixing music,deep visual models,audio models,music instrument source separation,video information,user-selected images,audio-only content,audio quality,audio-visual source separation model,visually conditioned music remixing system,deep audio models,separate-and-add method
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要