Information (Jun 2018)

Rack Aware Data Placement for Network Consumption in Erasure-Coded Clustered Storage Systems

  • Bilin Shao,
  • Dan Song,
  • Genqing Bian,
  • Yu Zhao

DOI
https://doi.org/10.3390/info9070150
Journal volume & issue
Vol. 9, no. 7
p. 150

Abstract

Read online

The amount of encoded data replication in an erasure-coded clustered storage system has a great impact on the bandwidth consumption and network latency, mostly during data reconstruction. Aimed at the reasons that lead to the excess data transmission between racks, a rack aware data block placement method is proposed. In order to ensure rack-level fault tolerance and reduce the frequency and amount of the cross-rack data transmission during data reconstruction, the method deploys partial data block concentration to store the data blocks of a file in fewer racks. Theoretical analysis and simulation results show that our proposed strategy greatly reduces the frequency and data volume of the cross-rack transmission during data reconstruction. At the same time, it has better performance than the typical random distribution method in terms of network usage and data reconstruction efficiency.

Keywords