Vision-language Pre-Training Via Modal Interaction
PATTERN RECOGNITION(2024)
Key words
Cross-modal,Pre-training,Partial auxiliary,Image captioning
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined