Chrome Extension
WeChat Mini Program
Use on ChatGLM

MovieChat: from Dense Token to Sparse Memory for Long Video Understanding

CVPR 2024(2024)

Cited 276|Views220
Key words
Video Understanding,Benchmark,Computational Complexity,Vision Tasks,Computational Memory,Memory Model,Memory Mechanisms,Memory Cost,Foundation Model,Long-term Memory,Short-term Memory,Quantitative Evaluation,Visual Features,Application Programming Interface,Video Frames,Memory Consolidation,Specific Moment,Video Content,Adjacent Frames,Video Features,Global Mode,Positional Encoding,Projection Layer,Extract Visual Features,Video Encoding,Merge Operation,TV Series,Multi-object Tracking,Extensive Case Studies,Short Video
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined