VioLET: Vision-Language Efficient Tuning with Collaborative Multi-modal Gradients
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023(2023)
Key words
vision language,parameter efficient tuning,multi-modal,few-shot recognition,prompt learning
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined