Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

International Journal of Computer Vision, Volume abs/1602.07332, Issue 1, 2017.

被引用1491|引用|浏览270|DOI:https://doi.org/10.1007/s11263-016-0981-7
EI
其它链接dblp.uni-trier.de|dl.acm.org|link.springer.com|academic.microsoft.com|arxiv.org

摘要

Despite progress in perceptual tasks such as image classification, computers still perform poorly on cognitive tasks such as image description and question answering. Cognition is core to tasks that involve not just recognizing, but reasoning about our visual world. However, models used to tackle the rich content in images for cognitive t...更多

代码

数据

下载 PDF 全文
您的评分 :
0

 

标签
评论