Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Abstract:
Image-language matching tasks have recently attracted a lot of attention in the computer vision field. These tasks include image-sentence matching, i.e., given an image query, retrieving relevant sentences and vice versa, and region-phrase matching or visual grounding, i.e., matching a phrase to relevant regions. This paper investigates...More
Code:
Data:
Tags
Comments