Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling
IROS, pp. 4848-4854, 2020.
Detecting sound source objects within visual observation is important for autonomous robots to comprehend surrounding environments. Since sounding objects have a large variety with different appearances in our living environments, labeling all sounding objects is impossible in practice. This calls for self-supervised learning which does...More
PPT (Upload PPT)