Planning for large-scale multiagent problems via hierarchical decomposition with applications to UAV health management

ACC(2014)

引用 20|浏览34
暂无评分
摘要
This paper introduces a novel hierarchical decomposition approach for solving Multiagent Markov Decision Processes (MMDPs) by exploiting coupling relationships in the reward function. MMDP is a natural framework for solving stochastic multi-stage multiagent decision-making problems, such as optimizing mission performance of Unmanned Aerial Vehicles (UAVs) with stochastic health dynamics. However, computing the optimal solutions is often intractable because the state-action spaces scale exponentially with the number of agents. Approximate solution techniques do exist, but they typically rely on extensive domain knowledge. This paper presents the Hierarchically Decomposed MMDP (HD-MMDP) algorithm, which autonomously identifies different degrees of coupling in the reward function and decomposes the MMDP into a hierarchy of smaller MDPs that can be solved separately. Solutions to the smaller MDPs are embedded in an autonomously constructed tree structure to generate an approximate solution to the original problem. Simulation results show HD-MMDP obtains more cumulative reward than that of the existing algorithm for a ten-agent Persistent Search and Track (PST) mission, which is a cooperative multi-UAV mission with more than 1019 states, stochastic fuel consumption model, and health progression model.
更多
查看译文
关键词
fault-tolerant systems,stochastic systems,robot dynamics,mmdp,uav health management,rescue robots,cooperative multi uav mission,trees (mathematics),approximation theory,decision making,stochastic multistage multiagent decision-making problems,large-scale multiagent problems,multi-robot systems,unmanned aerial vehicles,stochastic health dynamics,coupling relationships,reward function,autonomous aerial vehicles,health progression model,path planning,tree structure,condition monitoring,multiagent markov decision processes,stochastic fuel consumption model,markov processes,ten-agent persistent search-and-track mission,hierarchically decomposed mmdp algorithm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要