Learning Behaviors with Uncertain Human Feedback

He Xu
He Xu
Chen Haipeng
Chen Haipeng

UAI, pp. 131-140, 2020.

Cited by: 0|Bibtex|Views33
EI
Other Links: arxiv.org|dblp.uni-trier.de|academic.microsoft.com

Abstract:

Human feedback is widely used to train agents in many domains. However, previous works rarely consider the uncertainty when humans provide feedback, especially in cases that the optimal actions are not obvious to the trainers. For example, the reward of a sub-optimal action can be stochastic and sometimes exceeds that of the optimal act...More

Code:

Data:

Your rating :
0

 

Tags
Comments