CLIP-VG: Self-paced Curriculum Adapting of CLIP for Visual Grounding
IEEE TRANSACTIONS ON MULTIMEDIA(2024)
Key words
Grounding,Reliability,Adaptation models,Task analysis,Visualization,Data models,Annotations,Visual grounding,curriculum learning,pseudo-language label,and vision-language models
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined