Auto-Encoding and Distilling Scene Graphs for Image Captioning

IEEE Transactions on Pattern Analysis and Machine Intelligence(2022)

引用 42|浏览307
暂无评分
摘要
We propose scene graph auto-encoder (SGAE) that incorporates the language inductive bias into the encoder-decoder image captioning framework for more human-like captions. Intuitively, we humans use the inductive bias to compose collocations and contextual inferences in discourse. For example, when we see the relation “a person on a bike”, it is natural to replace “on” with “ride” and infer “a pers...
更多
查看译文
关键词
Visualization,Decoding,Training,Roads,Pipelines,Dictionaries,Semantics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要