High Performance Multi-Node File Copies and Checksums for Clustered File Systems.

Paul Z. Kolano, Robert Ciotti

LISA'10: Proceedings of the 24th international conference on Large installation system administration(2010)

引用 4|浏览2
暂无评分
摘要
Mcp and msum are drop-in replacements for the standard cp and md5sum programs that utilize multiple types of parallelism and other optimizations to achieve maximum copy and checksum performance on clustered file systems. Multi-threading is used to ensure that nodes are kept as busy as possible. Read/write parallelism allows individual operations of a single copy to be overlapped using asynchronous I/O. Multi-node cooperation allows different nodes to take part in the same copy/checksum. Split file processing allows multiple threads to operate concurrently on the same file. Finally, hash trees allow inherently serial checksums to be performed in parallel. This paper presents the design of mcp and msum and detailed performance numbers for each implemented optimization. It will be shown how mcp improves cp performance over 27×, msum improves md5sum performance almost 19×, and the combination of mcp and msum improves verified copies via cp and md5sum by almost 22×.
更多
查看译文
关键词
checksum performance,cp performance,detailed performance number,md5sum performance,file system,maximum copy,md5sum program,single copy,split file processing,standard cp,high performance multi-node file
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要