CAAN: Context-Aware attention network for visual question answering

Pattern Recognition(2022)

引用 23|浏览10
暂无评分
摘要
•Introduce contextual information into the VQA task for the first time and propose a context-aware model CAAN.•Employ the positional relationship information between the image regions and the image to obtain a context-enhanced visual representation.•First introduce question contextual information to enhance the question feature representation in VQA.•Reach a significant performance improvement or comparable performance compared with some other state-of-the-art VQA models.
更多
查看译文
关键词
Visual question answering,Attention mechanism,Understanding bias,Absolute position,Contextual information
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要