谷歌浏览器插件
订阅小程序
在清言上使用

Dynamic Memory Provisioning on Disaggregated HPC Systems.

SC-W '23: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis(2023)

引用 0|浏览6
暂无评分
摘要
Disaggregated memory is under investigation as a way to break the rigid boundaries between node memory hierarchies in order to provide memory as a system-wide pooled resource. The resource manager allocates the system’s disaggregated memory to jobs, based on the memory requirements defined by the user at job submission time. It is hard for the user to know the job’s precise peak memory footprint, and prior work has shown that users have an incentive to overestimate their needs. This overestimation leads to a significant overallocation of memory, and the majority of the physical memory in the system is wasted. This paper presents a way to reclaim much of this overallocated memory. We extend the Slurm job scheduler to dynamically reallocate memory, according to the job’s current memory footprint. We enhance an existing Slurm simulator to model this situation and combine publicly available traces to model an HPC system on up to 1490 nodes. Our results show that the dynamic memory provisioning approach increases the throughput per dollar by up to 38%, compared to a system with static allocation of disaggregated memory.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要