Image Caption Generation with Part of Speech Guidance
Pattern Recognition Letters(2019)
摘要
•A novel guiding mechanism based on Part of Speech (PoS) tags is proposed for guiding image caption generation process.•The framework achieves competitive performance over state-of-the-art models on Flickr30K and MS COCO datasets.•Results including some demonstrative examples are provided to show the effectiveness of the novel guiding mechanism.
更多查看译文
关键词
Image caption generation,Part-of-speech tags,Long Short-Term Memory,Visual attributes
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络