Combining Multiple Cues for Visual Madlibs Question Answering

International Journal of Computer Vision, pp. 1-23, 2018.

Cited by: 9|Bibtex|Views117|DOI:https://doi.org/10.1007/s11263-018-1096-0
EI
Other Links: link.springer.com|dblp.uni-trier.de|academic.microsoft.com|arxiv.org

Abstract:

This paper presents an approach for answering fill-in-the-blank multiple choice questions from the Visual Madlibs dataset. Instead of generic and commonly used representations trained on the ImageNet classification task, our approach employs a combination of networks trained for specialized tasks such as scene recognition, person activity...More

Code:

Data:

Your rating :
0

 

Tags
Comments