Chrome Extension
WeChat Mini Program
Use on ChatGLM

MS2A2Net: Multiscale Self-Attention Aggregation Network for Few-Shot Aerial Imagery Segmentation

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING(2024)

Cited 0|Views29
No score
Abstract
Few-shot aerial imagery segmentation refers to the task of segmenting specific objects in scenes that have not been encountered during training with a small amount of annotated data for reference. However, most existing few-shot segmentation algorithms are primarily designed for natural images, and there is still a lack of exploration in the context of remote sensing aerial imagery. In this article, we propose a novel multiscale self attention aggregation network (MS(2)A(2)Net), dubbed MS(2)A(2)Net, to address the challenge of few-shot aerial image segmentation in terms of scarce data and network architecture. Specifically, we first incorporate the designed asymmetric momentum contrastive learning (AMCL) into the pre-training stage, to improve the representation capability of the backbone without the expensive labeled data. Then the frozen encoder is transferred to the downstream few-shot segmentation task as the feature embedding. In terms of network architecture, we design self-attention aggregation in multiscale feature fusion, to construct the dual correlation of foreground and background between support and query features at the pixel level. Besides, the coordinate attention is designed to rearrange the distribution of feature importance in both horizontal and vertical spatial order perspectives, which facilitates adaptive fusion with the multiscale features. To verify the availability of the proposed MS(2)A(2)Net, we also reconstructed two novel datasets dedicated to few-shot aerial image segmentation, called DLRSD-4(i) and iSAID-4(i). The experimental results show that our approach MS(2)A(2)Net is superior in three few-shot benchmark aerial imagery segmentation datasets, which achieves competitive segmentation performance. Extensive ablation experiments also reflect the effectiveness and scalability of the proposed components and overall network architecture.
More
Translated text
Key words
Aerial imagery segmentation,few-shot learning,self-attention mechanism,self-supervised learning (SSL)
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined