Auto-encoding and Distilling Scene Graphs for Image Captioning
IEEE transactions on pattern analysis and machine intelligence, pp. 1-1, 2020.
We propose Scene Graph Auto-Encoder (SGAE) that incorporates the language inductive bias into the encoder-decoder image captioning framework for more human-like captions. Intuitively, we humans use the inductive bias to compose collocations and contextual inferences in discourse. For example, when we see the relation "a person on a bike",...More
Full Text (Upload PDF)
PPT (Upload PPT)