Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge
IEEE Trans. Pattern Anal. Mach. Intell., Volume abs/1609.06647, Issue 4, 2017, Pages 652-663.
Automatically describing the content of an image is a fundamental problem in artificial intelligence that connects computer vision and natural language processing. In this paper, we present a generative model based on a deep recurrent architecture that combines recent advances in computer vision and machine translation and that can be use...More
Full Text (Upload PDF)
PPT (Upload PPT)