BPR: An Erasure Coding Batch Parallel Repair Approach in Distributed Storage Systems

Ying Song, Wenxuan Zhao,Bo Wang

IEEE Access(2023)

引用 1|浏览7
暂无评分
摘要
Today, Erasure Coding is one of the most significant techniques widely used in distributed systems because it can improve reliability for large amounts of data with low storage overhead. However, when the distributed system encounters a large number of data loss in stripes and requires batch-stripes data recovery, current data recovery methods either repeat the single-stripe recovery method or only optimize partial stripe recovery when recovering large-scale stripes, which incurs heavy upload and download repair traffics and imbalanced load, affecting the efficiency of fault recovery and wasting additional resources. In this paper, we propose BPR, an Erasure Coding batch parallel repair approach for distributed storage systems. BPR reduces cross-rack network transfer time and increases recovery throughput by classifying the stripes and recovering the data of stripes in batches through the forward and reverse parallel data recovery. The experiment results show that for large-scale stripes recovery, BPR reduces the cross-rack network transfer time by up to 10% and increases the recovery throughput by up to 8% compared with the rPDL in some scenarios.
更多
查看译文
关键词
Distributed processing,Business process re-engineering,Traffic congestion,Encoding,Full-duplex system,Decoding,Bandwidth,Storage management,Distributed storage system,erasure coding,data recovery
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要