GSaaS: A Service to Cloudify and Schedule GPUs

Sergio Iserte; Raul Pena-Ortiz; Juan Gutierrez-Aguado; Jose M. Claver; Rafael Mayo

doi:10.1109/ACCESS.2018.2855261

IEEE Access (Jan 2018)

GSaaS: A Service to Cloudify and Schedule GPUs

Sergio Iserte,
Raul Pena-Ortiz,
Juan Gutierrez-Aguado,
Jose M. Claver,
Rafael Mayo

Affiliations

Sergio Iserte: ORCiD; Department of Computer Science and Engineering, Universitat Jaume I, Castelló de la Plana, Spain
Raul Pena-Ortiz: Department of Computer Science, Universitat de València, Burjassot, Spain
Juan Gutierrez-Aguado: ORCiD; Department of Computer Science, Universitat de València, Burjassot, Spain
Jose M. Claver: Department of Computer Science, Universitat de València, Burjassot, Spain
Rafael Mayo: Department of Computer Science and Engineering, Universitat Jaume I, Castelló de la Plana, Spain

DOI: https://doi.org/10.1109/ACCESS.2018.2855261
Journal volume & issue: Vol. 6
pp. 39762 – 39774

Abstract

Read online

Cloud technology is an attractive infrastructure solution that provides customers with an almost unlimited on-demand computational capacity using a pay-per-use approach, and allows data centers to increase their energy and economic savings by adopting a virtualized resource sharing model. However, resources such as graphics processing units (GPUs), have not been fully adapted to this model. Although, general-purpose computing on graphics processing units (GPGPU) is becoming more and more popular, cloud providers lack of flexibility to manage accelerators, because of the extended use of peripheral component interconnect (PCI) passthrough techniques to attach GPUs to virtual machines (VMs). For this reason, we design, develop, and evaluate a service that provides a complete management of cloudified GPUs (cGPUs) in public cloud platforms. Our solution enables an effective, anonymous, and transparent access from VMs to cGPUs that are previously scheduled and assigned by a full resource manager, taking into account new GPU selection policies and new working modes based on the locality of the physical accelerators and the exclusivity when accessing them. This easy-to-adopt tool improves the resource availability through different cGPUs configurations for end-users, whilst cloud providers are able to achieve a better utilization of their infrastructures and offer more competitive services. Scalability results in a real cloud environment demonstrate that our solution introduces a virtually null overhead in the deployment of VMs. Besides, performance experiments reveal that GPU-enabled clusters based on cloud infrastructures can benefit from our proposal not only exploiting better the accelerators, but also serving more jobs requests per unit of time.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords