A Decomposable Causal View of Compositional Zero-Shot Learning

Muli Yang, Chenghao Xu,Aming Wu,Cheng Deng

IEEE TRANSACTIONS ON MULTIMEDIA（2023）

Cited 2|Views204

No score

Abstract

Composing and recognizing novel concepts that are combinations of known concepts, i.e., compositional generalization, is one of the greatest power of human intelligence. With the development of artificial intelligence, it becomes increasingly appealing to build a vision system that can generalize to unknown compositions based on restricted known knowledge, which has so far remained a great challenge to our community. In fact, machines can be easily misled by superficial correlations in the data, disregarding the causal patterns that are crucial to generalization. In this paper, we rethink compositional generalization with a causal perspective, upon the context of Compositional Zero-Shot Learning (CZSL). We develop a simple yet strong approach based on our novel Decomposable Causal view (dubbed "DECA"), by approximating the causal effect with the combination of three easy-to-learn components. Our proposed DECA(1) is evaluated on two challenging CZSL benchmarks by recognizing unknown compositions of known concepts. Despite being simple in the design, our approach achieves consistent improvements over state-of-the-art baselines, demonstrating its superiority towards the goal of compositional generalization.

Translated text

Key words

Compositional zero-shot learning,vision and language,image recognition,causality

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined