Revisiting Image-Language Networks for Open-Ended Phrase Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence(2022)

引用 26|浏览156
暂无评分
摘要
Most existing work that grounds natural language phrases in images starts with the assumption that the phrase in question is relevant to the image. In this paper we address a more realistic version of the natural language grounding task where we must both identify whether the phrase is relevant to an image and localize the phrase. This can also be viewed as a generalization of object ...
更多
查看译文
关键词
Task analysis,Grounding,Visualization,Feature extraction,Benchmark testing,Detectors,Vocabulary
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要