DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2017)

引用 1052|浏览139
暂无评分
摘要
We introduce a Deep Stochastic IOC RNN Encoderdecoder framework, DESIRE, for the task of future predictions of multiple interacting agents in dynamic scenes. DESIRE effectively predicts future locations of objects in multiple scenes by 1) accounting for the multi-modal nature of the future prediction (i.e., given the same context, future may vary), 2) foreseeing the potential future outcomes and make a strategic prediction based on that, and 3) reasoning not only from the past motion history, but also from the scene context as well as the interactions among the agents. DESIRE achieves these in a single end-to-end trainable neural network model, while being computationally efficient. The model first obtains a diverse set of hypothetical future prediction samples employing a conditional variational autoencoder, which are ranked and refined by the following RNN scoring-regression module. Samples are scored by accounting for accumulated future rewards, which enables better long-term strategic decisions similar to IOC frameworks. An RNN scene context fusion module jointly captures past motion histories, the semantic scene context and interactions among multiple agents. A feedback mechanism iterates over the ranking and refinement to further boost the prediction accuracy. We evaluate our model on two publicly available datasets: KITTI and Stanford Drone Dataset. Our experiments show that the proposed model significantly improves the prediction accuracy compared to other baseline methods.
更多
查看译文
关键词
DESIRE,distant future prediction,dynamic scenes,multiple interacting agents,single end-to-end trainable neural network model,hypothetical future prediction samples,conditional variational auto-encoder,RNN scoring-regression module,IOC frameworks,RNN scene context fusion module,semantic scene context,multiple agents,prediction accuracy,deep stochastic IOC RNN encoder-decoder framework,KITTI dataset,Stanford Drone dataset
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要