On the Complexity and Performance of the Information Dispersal Algorithm

Ricardo Marcelin-Jimenez; Jorge Luis Ramirez-Ortiz; Enrique Rodriguez De La Colina; Michael Pascoe-Chalke; Jose Luis Gonzalez-Compean

doi:10.1109/ACCESS.2020.3020501

IEEE Access (Jan 2020)

On the Complexity and Performance of the Information Dispersal Algorithm

Ricardo Marcelin-Jimenez,
Jorge Luis Ramirez-Ortiz,
Enrique Rodriguez De La Colina,
Michael Pascoe-Chalke,
Jose Luis Gonzalez-Compean

Affiliations

Ricardo Marcelin-Jimenez: ORCiD; Department of Electrical Engineering, Universidad Autónoma Metropolitana, CDMX, Mexico
Jorge Luis Ramirez-Ortiz: ORCiD; Department of Electrical Engineering, Universidad Autónoma Metropolitana, CDMX, Mexico
Enrique Rodriguez De La Colina: ORCiD; Department of Electrical Engineering, Universidad Autónoma Metropolitana, CDMX, Mexico
Michael Pascoe-Chalke: ORCiD; Department of Electrical Engineering, Universidad Autónoma Metropolitana, CDMX, Mexico
Jose Luis Gonzalez-Compean: ORCiD; CINVESTAV, Tamaulipas, Mexico

DOI: https://doi.org/10.1109/ACCESS.2020.3020501
Journal volume & issue: Vol. 8
pp. 159284 – 159290

Abstract

Read online

The Information Dispersal Algorithm (IDA) has become a key component in several fault-tolerant massive storage systems. From a theoretical point of view, it is a linear transformation over a finite field on the vectors that make up a given file. Direct transformation adds redundancy, splitting the initial file into a new set of files called dispersals. The inverse transformation recovers the original file from a subset of dispersals. This piece of research demonstrates the impact of input and output (I/O) operations on direct and inverse transformations. Different alternatives to control the exchange of elements between RAM and disk were evaluated, which is the key operation to build a vector in memory and store its entries in a file. First, the impact of the working finite field was tested; second, the impact of the use of a buffer for exchange between the RAM and the hard disk, and finally, several instances of the algorithm with which to evaluate the impact of parallelism were simultaneously deployed. The results demonstrate that the combination of these factors may have an important effect on the speed of both direct and inverse procedures.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords