Learning To Make Better Mistakes: Semantics-Aware Visual Food Recognition

MM(2016)

引用 85|浏览147
暂无评分
摘要
We propose a visual food recognition framework that integrates the inherent semantic relationships among fine-grained classes. Our method learns semantics-aware features by formulating a multi-task loss function on top of a convolutional neural network (CNN) architecture. It then re fines the CNN predictions using a random walk based smoothing procedure, which further exploits the rich semantic information.We evaluate our algorithm on a large "food-in-the-wild" benchmark [3], as well as a challenging dataset of restaurant food dishes with very few training images. The proposed method achieves higher classification accuracy than a baseline which directly fine-tunes a deep learning network on the target dataset. Furthermore, we analyze the consistency of the learned model with the inherent semantic relationships among food categories. Results show that the proposed approach provides more semantically meaningful results than the baseline method, even in cases of mispredictions.
更多
查看译文
关键词
Food Recognition,Multi-task Learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要