Improving Decision-Making Policy for Cognitive Radio Applications Using Reinforcement Learning

Rajat Singh,Jayant Kumar Rai,Pinku Ranjan,Rakesh Chowdhury

2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI)（2024）

引用 0|浏览0

暂无评分

摘要

Motivated by cognitive radios, there has been a recent increase of interest in stochastic multi-player multi-armed bandits. In this context of cognitive radio’s, autonomous players concurrently engage in arm or channel pulls, individually opti-mizing rewards. Complexity amplifies with potential collisions, wherein multiple players simultaneously select a common arm, resulting in zero collective reward. Our work centers on the Multiplayer Multi-Armed Bandit (MMAB) problem, involving M decision makers collaborating to maximize cumulative reward in cognitive radio application. Collision prompts players to adapt. We introduce RobustMMAB, a decentralized algorithm aiming to achieve regret akin to an optimal centralized algorithm while increasing resilience against selfish nodes.

查看译文

关键词

Reinforcement Learning,Cognitive Radio,Selfish Robustness,Multi-player Multi-arm Bandit

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要