Sparse And Structured Visual Attention

2021 IEEE International Conference on Image Processing (ICIP)(2021)

引用 4|浏览53
暂无评分
摘要
Visual attention mechanisms are widely used in multimodal tasks, as visual question answering (VQA). One drawback of softmax-based attention mechanisms is that they assign some probability mass to all image regions, regardless of their adjacency structure and of their relevance to the text. In this paper, to better link the image structure with the text, we replace the traditional softmax attentio...
更多
查看译文
关键词
Visualization,Image processing,Conferences,Knowledge discovery,Task analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要