Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
International Journal of Computer Vision, Volume abs/1602.07332, Issue 1, 2017.
EI
摘要:
Despite progress in perceptual tasks such as image classification, computers still perform poorly on cognitive tasks such as image description and question answering. Cognition is core to tasks that involve not just recognizing, but reasoning about our visual world. However, models used to tackle the rich content in images for cognitive t...更多
代码:
数据:
下载 PDF 全文
标签
评论