Focal Visual-Text Attention for Memex Question Answering
IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1-1, 2019.
EI WOS
Abstract:
Recent insights on language and vision with neural networks have been successfully applied to simple single-image visual question answering. However, to tackle real-life question answering problems on multimedia collections such as personal photo albums, we have to look at whole collections with sequences of photos. This paper proposes a ...More
Code:
Data:
Tags
Comments