A Survey of Vision and Language Related Multi-Modal Task
CAAI Artificial Intelligence Research(2022)
Key words
deep learning,vision and language,multi-modal generation,multi-modal analysis,multi-modal reasoning,pre-training
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined