MA-TREX: Mutli-agent Trajectory-Ranked Reward Extrapolation via Inverse Reinforcement Learning

Sili Huang
Sili Huang
Haiyin Piao
Haiyin Piao
Zhixiao Sun
Zhixiao Sun
Yi Chang
Yi Chang

knowledge science, engineering and management, pp. 3-14, 2020.

被引用0|浏览7

摘要

Trajectory-ranked reward extrapolation (T-REX) provides a general framework to infer users’ intentions from sub-optimal demonstrations. However, it becomes inflexible when encountering multi-agent scenarios, due to its high complexity caused by rational behaviors, e.g., cooperation and communication. In this paper, we propose a novel Mult...更多

代码

数据

ZH
24小时获取PDF
引用
您的评分 :
0

 

标签
评论