Interpretable Multi-Objective Reinforcement Learning through Policy Orchestration

Ritesh Noothigattu
Ritesh Noothigattu
Djallel Bouneffouf
Djallel Bouneffouf
Rachita Chandra
Rachita Chandra
Piyush Madan
Piyush Madan

arXiv: Learning, Volume abs/1809.08343, 2018.

Cited by: 19|Bibtex|Views26
EI
Other Links: dblp.uni-trier.de|academic.microsoft.com|arxiv.org

Abstract:

Autonomous cyber-physical agents and systems play an increasingly large role in our lives. To ensure that agents behave in ways aligned with the values of the societies in which they operate, we must develop techniques that allow these agents to not only maximize their reward in an environment, but also to learn and follow the implicit co...More

Code:

Data:

Your rating :
0

 

Tags
Comments