MAAS: Multi-modal Assignation for Active Speaker Detection

Cited by: 0|Views8

Abstract:

Active speaker detection requires a solid integration of multi-modal cues. While individual modalities can approximate a solution, accurate predictions can only be achieved by explicitly fusing the audio and visual features and modeling their temporal progression. Despite its inherent muti-modal nature, current methods still focus on mo...More

Code:

Data:

Full Text
Bibtex
Your rating :
0

 

Tags
Comments