Sparse Multimodal Vision Transformer for Weakly Supervised Semantic Segmentation.
CVPR Workshops(2023)
Key words
attention head,complex computer vision tasks,Convolutional Neural Networks,fine-grained annotations,fully-supervised training,high-quality segmentation masks,human annotation load,land cover segmentation,leverages image-level labels,par,pixel-level labels,remote sensing applications,segmentation model,semantic segmentation task,sparse multimodal vision Transformer,un-pruned attention heads,Vision Transformers,weakly supervised semantic segmentation,weakly-supervised semantic segmentation,weakly-supervised vision Transformer
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined