Improving Decision-Making Policy for Cognitive Radio Applications Using Reinforcement Learning

2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI)(2024)

引用 0|浏览0
暂无评分
摘要
Motivated by cognitive radios, there has been a recent increase of interest in stochastic multi-player multi-armed bandits. In this context of cognitive radio’s, autonomous players concurrently engage in arm or channel pulls, individually opti-mizing rewards. Complexity amplifies with potential collisions, wherein multiple players simultaneously select a common arm, resulting in zero collective reward. Our work centers on the Multiplayer Multi-Armed Bandit (MMAB) problem, involving M decision makers collaborating to maximize cumulative reward in cognitive radio application. Collision prompts players to adapt. We introduce RobustMMAB, a decentralized algorithm aiming to achieve regret akin to an optimal centralized algorithm while increasing resilience against selfish nodes.
更多
查看译文
关键词
Reinforcement Learning,Cognitive Radio,Selfish Robustness,Multi-player Multi-arm Bandit
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要