MIMIC-IT: Multi-Modal In-Context Instruction Tuning.
CoRR(2023)
Key words
Visual Question Answering,Multimedia Learning,Vocabulary Learning,Multilingual Neural Machine Translation,Language Learning
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined