Emotional Dialogue Generation using Image-Grounded Language Models
Conference on Human Factors in Computing Systems(2018)
摘要
ABSTRACTComputer-based conversational agents are becoming ubiquitous. However, for these systems to be engaging and valuable to the user, they must be able to express emotion, in addition to providing informative responses. Humans rely on much more than language during conversations; visual information is key to providing context. We present the first example of an image-grounded conversational agent using visual sentiment, facial expression and scene features. We show that key qualities of the generated dialogue can be manipulated by the features used for training the agent. We evaluate our model on a large and very challenging real-world dataset of conversations from social media (Twitter). The image-grounding leads to significantly more informative, emotional and specific responses, and the exact qualities can be tuned depending on the image features used. Furthermore, our model improves the objective quality of dialogue responses when evaluated on standard natural language metrics.
更多查看译文
关键词
Dialogue, conversation, emotion, computer vision, conversational agents
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络