A Unified Generation-Retrieval Framework for Image Captioning

Chunpu Xu,Wei Zhao,Min Yang,Xiang Ao,Wangrong Cheng,Jinwen Tian

Proceedings of the 28th ACM International Conference on Information and Knowledge Management（2019）

引用 7|浏览55

暂无评分

摘要

Recent image captioning approaches are typically trained on generation-based or retrieval-based approaches. Both methods have their advantages but limited by the disadvantages. In this paper, we propose a Unified Generation-Retrieval framework for Image Captioning (UGRIC) by using adversarial learning. Different from previous methods, the proposed UGRIC model leverages the informative contents of N-best response candidates provided by the retrieval-based model to enhance the generation-based method. In addition, to further improve the informativeness of the generated caption, we employ copying mechanism to choose words from the retrieved candidate captions and put them into proper positions of the output sequence. Experiments on MSCOCO dataset demonstrate the effectiveness of the UGRIC model through various evaluation metrics.\footnoteCode and data are available at: \urlhttp://tinyurl.com/y6z2x6ho.

查看译文

关键词

adversarial learning, copying mechanism, image captioning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要