Instruction-guided object detection

Proceedings of the ACM Turing Celebration Conference - China(2019)

引用 0|浏览56
暂无评分
摘要
This paper aims at instruction-guided object detection, i.e., predicting the objects associated with the implementation of a specific instruction for intelligent robots. It is not practical to solve this problem by picking out the instruction-related objects from the detection results of a general object detector due to massive annotation cost and low adaptability of training data. To address these challenges, we introduce a flexible dataset that can well adapt to the variation of the instruction set and only annotates instruction-related object samples. We then propose to amend the current detection paradigm by incorporating semantic instruction description effectively. Specifically, the relationship between an instruction and related objects is modeled by the and-or graph and is further fused into a unified neural network for solving the object detection problem constrained by instructions. Our algorithm encodes the and-or graph representation to attend related objects sensitive local features and then infer instruction-related object location. Extensive evaluation on the newly constructed dataset verifies the effectiveness of our approach.
更多
查看译文
关键词
and-or graph, attention modeling, instruction, object detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要