The Predictron: End-To-End Learning and Planning.
international conference on machine learning, (2017)
One of the key challenges of artificial intelligence is to learn models that are effective in the context of planning. In this document we introduce the predictron architecture. predictron consists of a fully abstract model, represented by a Markov reward process, that can be rolled forward multiple imagined planning steps. Each forward ...更多
下载 PDF 全文