Pream: Enhancing HPC Storage System Performance with Pre-Allocated Metadata Management Mechanism

2019 IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS)(2019)

引用 2|浏览16
暂无评分
摘要
With the convergence of high performance computing (HPC) and big data, processing a large volume of scientific data on HPC systems is getting increased attentions. However, supporting these data-intensive workloads on HPC systems that are geared for compute-intensive workloads presents a new challenge in data management. Typical HPC systems consist of a large collection of compute nodes and use parallel file systems (PFSs) for persistent data storage. Although PFSs provide concurrent I/O bandwidth and perform well for large sequential write/read requests, its performance is bottlenecked by expensive metadata operations. Moreover, data-intensive applications with bursty I/O patterns or generate a large number of temporary files exacerbate the shortcomings. In this paper, we propose Pream, a light-weight metadata management framework that aim to address these challenges. Pream targets scenarios of supporting data-intensive workloads that generates a huge number of temporary files on diskless compute nodes. Pream pre-allocates file metadata from the metadata server, and manages these metadata locally to accelerate metadata operations. While newly created temporary files keep residing in PFSs, open/create requests of these temporary files can be handled by Pream locally without connecting with PFSs. Our evaluation demonstrates that Pream can outperform Lustre in many workloads and reduce latency of metadata operation efficiently.
更多
查看译文
关键词
high performance computing,big data,parallel file system,pre-allocate,metadata management
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要