Attention-Driven Cross-Modal Remote Sensing Image Retrieval.

Ushasi Chaudhuri,Biplab Banerjee,Avik Bhattacharya,Mihai Datcu

IGARSS（2021）

引用 4|浏览5

暂无评分

摘要

In this work, we address a cross-modal retrieval problem in remote sensing (RS) data. A cross-modal retrieval problem is more challenging than the conventional uni-modal data retrieval frameworks as it requires learning of two completely different data representations to map onto a shared feature space. For this purpose, we chose a photo-sketch RS database. We exploit the data modality comprising more spatial information (sketch) to extract the other modality features (photo) with cross-attention networks. This sketch-attended photo features are more robust and yield better retrieval results. We validate our proposal by performing experiments on the benchmarked Earth on Canvas dataset. We show a boost in the overall performance in comparison to the existing literature. Besides, we also display the Grad-CAM visualizations of the trained model's weights to highlight the framework's efficacy.

查看译文

关键词

Cross-modal retrieval,Remote Sensing,Sketch-based image retrieval,Attention network,Deep learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要