Adaptive Provisioning of Heterogeneous Cloud Resources for Big Data Processing

Maarten Kollenstart; Edwin Harmsma; Erik Langius; Vasilios Andrikopoulos; Alexander Lazovik

doi:10.3390/bdcc2030015

Big Data and Cognitive Computing (Jul 2018)

Adaptive Provisioning of Heterogeneous Cloud Resources for Big Data Processing

Maarten Kollenstart,
Edwin Harmsma,
Erik Langius,
Vasilios Andrikopoulos,
Alexander Lazovik

Affiliations

Maarten Kollenstart: Monitoring and Control Systems, TNO Groningen, Eemsgolaan 3, 9727 DW Groningen, The Netherlands
Edwin Harmsma: Monitoring and Control Systems, TNO Groningen, Eemsgolaan 3, 9727 DW Groningen, The Netherlands
Erik Langius: Monitoring and Control Systems, TNO Groningen, Eemsgolaan 3, 9727 DW Groningen, The Netherlands
Vasilios Andrikopoulos: Faculty of Science and Engineering, University of Groningen, Nijenborgh 9, 9747 AG Groningen, The Netherlands
Alexander Lazovik: Faculty of Science and Engineering, University of Groningen, Nijenborgh 9, 9747 AG Groningen, The Netherlands

DOI: https://doi.org/10.3390/bdcc2030015
Journal volume & issue: Vol. 2, no. 3
p. 15

Abstract

Read online

Efficient utilization of resources plays an important role in the performance of large scale task processing. In cases where heterogeneous types of resources are used within the same application, it is hard to achieve good utilization of all of the different types of resources. By taking advantage of recent developments in cloud infrastructure that enable the use of dynamic clusters of resources, and by dynamically altering the size of the available resources for all the different resource types, the overall utilization of resources, however, can be improved. Starting from this premise, this paper discusses a solution that aims to provide a generic algorithm to estimate the desired ratios of instance processing tasks as well as ratios of the resources that are used by these instances, without the necessity for trial runs or a priori knowledge of the execution steps. These ratios are then used as part of an adaptive system that is able to reconfigure itself to maximize utilization. To verify the solution, a reference framework which adaptively manages clusters of functionally different VMs to host a calculation scenario is implemented. Experiments are conducted based on a compute-heavy use case in which the probability of underground pipeline failures is determined based on the settlement of soils. These experiments show that the solution is capable of eliminating large amounts of under-utilization, resulting in increased throughput and lower lead times.

Published in Big Data and Cognitive Computing

ISSN: 2504-2289 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology
Website: http://www.mdpi.com/journal/BDCC

About the journal

Abstract

Keywords