Delving Deeper into the Decoder for Video Captioning

Chen Haoran
Chen Haoran
Li Jianmin
Li Jianmin

ECAI, pp. 1079-1086, 2020.

Cited by: 3|Bibtex|Views10|DOI:https://doi.org/10.3233/FAIA200204
EI
Other Links: arxiv.org|dblp.uni-trier.de|academic.microsoft.com

Abstract:

Video captioning is an advanced multi-modal task which aims to describe a video clip using a natural language sentence. The encoder-decoder framework is the most popular paradigm for this task in recent years. However, there still exist some non-negligible problems in the decoder of a video captioning model. We make a thorough investiga...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments