MCCE-REC: MLLM-driven Cross-modal Contrastive Entropy Model for Zero-shot Referring Expression Comprehension
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY(2025)
Key words
Visualization,Proposals,Feature extraction,Entropy,Circuits and systems,Detectors,Cognition,Zero-shot referring expression comprehension,multi-cues cross-modal fusion,contrastive similarity entropy
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined