Chrome Extension
WeChat Mini Program
Use on ChatGLM

Fair Resource Reusing for D2D Communication Based on Reinforcement Learning.

SGIoT(2020)

Cited 0|Views2
No score
Abstract
Device-to-device (D2D) communications can improve the overall network performance, including low latency, high data rates, and system capability for the fifth generation (5G) wireless networks. The system capability can even be improved by reusing resource between D2D user equipment (DUE) and cellular user equipment (CUE) without bring harmful interference to the CUEs. A D2D resource allocation method is expected to have the characteristic that one CUE can be allocated with variable number of resource blocks (RBs), and the RBs can be reused by more than one CUE. In this study, Multi-Player Multi-Armed Bandit (MPMAB) reinforcement learning method is employed to model such problem by establishing preference matrix. A fair resource allocation method is then proposed to achieve fairness, prevent wasting resource, and alleviate starvation. This method even has better throughput if there are not too many D2D pairs.
More
Translated text
Key words
Device-to-Device (D2D),Resource allocation,Reinforcement learning,Multi-Player Multi-Armed Bandit (MPMAB),Dynamic resource allocation
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined