谷歌浏览器插件
订阅小程序
在清言上使用

Entropy Normalization SAC-Based Task Offloading for UAV-Assisted Mobile Edge Computing

Tan Deng, Yanping Wang, Jin Li,Ronghui Cao, Yongtong Gu, Jinming Hu,Xiaoyong Tang,Mingfeng Huang,Wenzheng Liu, Shixue Li

IEEE Internet Things J(2024)

引用 0|浏览6
暂无评分
摘要
With the advantages of maneuverability and low cost, Unmanned Aerial Vehicles (UAVs) are widely deployed in mobile edge computing as micro servers to provide computing service. However, tasks usually require a large amount of energy and have strict time constraints, while the battery energy and endurance of UAVs are limited. Therefore, energy consumption and delay have become key issues in such architectures. To address this issue, an Entropy Normalized Soft Actor-Critic (ENSAC) computation offloading algorithm is proposed in this paper, aiming to minimize the weighted sum of task offloading delay and energy consumption. In ENSAC, we formulate the task offloading problem as a Markov Decision Process (MDP). Considering the non-convexity, high-dimensional state space, and continuous action space of this problem, the ENSAC algorithm fully combines deviation strategy and maximum entropy reinforcement learning, and designs a system utility function under entropy normalization as a reward function, thus ensuring fairness in weighted energy consumption and delay. What’s more, ENSAC algorithm also considers UAV trajectory planning, task offloading ratio, and power allocation in the UAV-assisted MEC system. Therefore, compared with previous methods, ENSAC algorithm has stronger stability, better exploration performance, and can handle more complex environments and larger action space. Finally, extensive experiments demonstrate that, in both energy-saving and delay-sensitive scenarios, the ENSAC algorithm can quickly converge to the optimal solution while maintaining stability. Compared with four benchmark algorithms, it reduces the total system cost by 52.73%.
更多
查看译文
关键词
Mobile edge computing,Unmanned aerial vehicle,Computation offloading,Deep reinforcement learning,QoS
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要