Hidden State Guidance: Improving Image Captioning Using an Image Conditioned Autoencoder
ViGIL@NeurIPS, 2019.
Abstract:
Most RNN-based image captioning models receive supervision on the output words to mimic human captions. Therefore, the hidden states can only receive noisy gradient signals via layers of back-propagation through time, leading to less accurate generated captions. Consequently, we propose a novel framework, Hidden State Guidance (HSG), th...More
Code:
Data:
Full Text
Tags
Comments