Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network

Cited by: 1|Views44
Weibo:
We present Global Enhanced Transformer for image captioning

Abstract:

Transformer-based architectures have shown great success in image captioning, where object regions are encoded and then attended into the vectorial representations to guide the caption decoding. However, such vectorial representations only contain region-level information without considering the global information reflecting the entire ...More

Code:

Data:

0
Full Text
Bibtex
Weibo
Your rating :
0

 

Tags
Comments