Hierarchical Memory Decoder for Visual Narrating
IEEE Transactions on Circuits and Systems for Video Technology(2021)
摘要
Visual narrating focuses on generating semantic descriptions to summarize visual content of images or videos, e.g., visual captioning and visual storytelling. The challenge mainly lies in how to design a decoder to generate accurate descriptions matching visual content. Recent advances often employ a recurrent neural network (RNN), e.g., Long-Short Term Memory (LSTM), as the decoder. However, RNN ...
更多查看译文
关键词
Decoding,Visualization,Videos,Task analysis,Computer architecture,Electronic mail,Semantics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络