Belief Tree Search for Active Object Recognition

2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)（2017）

引用 10|浏览19

暂无评分

摘要

Active Object Recognition (AOR) has been approached as an unsupervised learning problem, in which optimal trajectories for object inspection are not known and are to be discovered by reducing label uncertainty measures or training with reinforcement learning. Such approaches have no guarantees of the quality of their solution. In this paper, we treat AOR as a Partially Observable Markov Decision Process (POMDP) and find near-optimal policies on training data using Belief Tree Search (BTS) on the corresponding belief Markov Decision Process (MDP). AOR then reduces to the problem of knowledge transfer from near-optimal policies on training set to the test set. We train a Long Short Term Memory (LSTM) network to predict the best next action on the training set rollouts. We sho that the proposed AOR method generalizes well to novel views of familiar objects and also to novel objects. We compare this supervised scheme against guided policy search, and find that the LSTM network reaches higher recognition accuracy compared to the guided policy method. We further look into optimizing the observation function to increase the total collected reward of optimal policy. In AOR, the observation function is known only approximately. We propose a gradient-based method update to this approximate observation function to increase the total reward of any policy. We show that by optimizing the observation function and retraining the supervised LSTM network, the AOR performance on the test set improves significantly.

查看译文

关键词

observation function,Belief tree search,active object recognition,unsupervised learning problem,optimal trajectories,object inspection,label uncertainty,reinforcement learning,Partially Observable Markov Decision Process,near-optimal values,Belief Tree Search,Long Short Term Memory network,higher recognition accuracy,active recognition,LSTM,Markov Decision Process,BTS,Long Short Term Memory,BTS

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要