Eliciting additive reward functions for Markov decision processes
IJCAI, pp. 2159-2164, 2011.
Specifying the reward function of a Markov decision process (MDP) can be demanding, requiring human assessment of the precise quality of, and tradeoffs among, various states and actions. However, reward functions often possess considerable structure which can be leveraged to streamline their specification. We develop new, decision-theoret...More
PPT (Upload PPT)