Inference-based Reinforcement Learning and Its Application to Dynamic Resource Allocation

2022 30th European Signal Processing Conference (EUSIPCO)（2022）

引用 0|浏览2

暂无评分

摘要

Reinforcement learning (RL) is a powerful machine learning technique to learn optimal actions in a control system setup. An important drawback of RL algorithms is the need for balancing exploitation vs exploration. Exploration corresponds to taking randomized actions with the aim to learn from it and make better decisions in the future. However, these exploratory actions result in poor performance, and current RL algorithms have a slow convergence as one can only learn from a single action outcome per iteration. We propose a novel concept of Inference-based RL that is applicable to a specific class of RL problems, and that allows to eliminate the performance impact caused by traditional exploration strategies, thereby making RL performance more consistent and greatly improving the convergence speed. The specific RL problem class is a problem class in which the observation of the outcome of one action can be used to infer the outcome of other actions, without the need to actually perform them. We apply this novel concept to the use case of dynamic resource allocation, and show that the proposed algorithm outperforms existing RL algorithms, yielding a drastic increase in both convergence speed and performance.

查看译文

关键词

optimal actions,control system setup,important drawback,exploration corresponds,randomized actions,exploratory actions,single action outcome,Inference-based RL,RL problems,performance impact,traditional exploration strategies,RL performance,convergence speed,dynamic resource allocation,Inference-based reinforcement learning,machine learning technique,RL problem class

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要