Evaluation of Proximal Policy Optimization with Extensions in Virtual Environments of Various Complexity

International Conference Radioelektronika（2022）

引用 0|浏览0

暂无评分

摘要

This paper presents an evaluation of the Proximal Policy Optimization deep reinforcement learning algorithm alone and with several extensions. The extensions used are imitation learning and intrinsic curiosity module. The evaluation takes place in a custom-made racing game, where the goal of an agent, a car, is to complete a racing circuit. The game is implemented in the Unity game engine and the machine learning algorithms are provided by the ML-Agents toolkit. The game provides three levels with racing circuits of different complexity. The results described focus on the hyperparameter search, training progress for all the combinations of the algorithm and the extensions and evaluation in the inference mode.

查看译文

关键词

reinforcement learning,Proximal Policy Optimization,evaluation,virtual environment,Unity,ML-Agents

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要