CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions
CVPR, Volume abs/1901.00850, 2019, Pages 4185-4194.
Referring object detection and referring image segmentation are important tasks that require joint understanding of visual information and natural language. Yet there has been evidence that current benchmark datasets suffer from bias, and current state-of-the-art models cannot be easily evaluated on their intermediate reasoning process. T...More
PPT (Upload PPT)