Sparse And Structured Visual Attention

Martins Pedro Henrique,Niculae Vlad,Marinho Zita,Martins André

2021 IEEE International Conference on Image Processing (ICIP)（2021）

引用 4|浏览53

暂无评分

摘要

Visual attention mechanisms are widely used in multimodal tasks, as visual question answering (VQA). One drawback of softmax-based attention mechanisms is that they assign some probability mass to all image regions, regardless of their adjacency structure and of their relevance to the text. In this paper, to better link the image structure with the text, we replace the traditional softmax attentio...

查看译文

关键词

Visualization,Image processing,Conferences,Knowledge discovery,Task analysis

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要