Compressing Data Cube in Parallel OLAP Systems

Frank Dehne; Todd Eavis; Boyong Liang

doi:10.2481/dsj.6.S184

Data Science Journal (Mar 2007)

Compressing Data Cube in Parallel OLAP Systems

Frank Dehne,
Todd Eavis,
Boyong Liang

Affiliations

Frank Dehne: School of Computer Science, Carleton University, 1125 Colonel By Drive, Ottawa, Canada K1S 5B6
Todd Eavis: Computer Science Software Engineering, Concordia University, 1455 De Maisonneuve Blvd. West, Montreal, Canada, H3G 1M8
Boyong Liang: School of Computer Science, Carleton University, 1125 Colonel By Drive, Ottawa, Canada K1S 5B6

DOI: https://doi.org/10.2481/dsj.6.S184
Journal volume & issue: Vol. 6

Abstract

Read online

This paper proposes an efficient algorithm to compress the cubes in the progress of the parallel data cube generation. This low overhead compression mechanism provides block-by-block and record-by-record compression by using tuple difference coding techniques, thereby maximizing the compression ratio and minimizing the decompression penalty at run-time. The experimental results demonstrate that the typical compression ratio is about 30:1 without sacrificing running time. This paper also demonstrates that the compression method is suitable for Hilbert Space Filling Curve, a mechanism widely used in multi-dimensional indexing.

Published in Data Science Journal

ISSN: 1683-1470 (Online)
Publisher: Ubiquity Press
Country of publisher: United Kingdom
LCC subjects: Science: Science (General)
Website: http://datascience.codata.org/

About the journal

Abstract

Keywords