Delving Deeper into the Decoder for Video Captioning
ECAI, pp. 1079-1086, 2020.
Video captioning is an advanced multi-modal task which aims to describe a video clip using a natural language sentence. The encoder-decoder framework is the most popular paradigm for this task in recent years. However, there still exist some non-negligible problems in the decoder of a video captioning model. We make a thorough investiga...More
PPT (Upload PPT)