Energy-Aware Selective Inference Task Offloading for Real-Time Edge Computing Applications

Abdelkarim Ben Sada; Amar Khelloufi; Abdenacer Naouri; Huansheng Ning; Sahraoui Dhelim

doi:10.1109/ACCESS.2024.3404272

IEEE Access (Jan 2024)

Energy-Aware Selective Inference Task Offloading for Real-Time Edge Computing Applications

Abdelkarim Ben Sada,
Amar Khelloufi,
Abdenacer Naouri,
Huansheng Ning,
Sahraoui Dhelim

Affiliations

Abdelkarim Ben Sada: School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Amar Khelloufi: ORCiD; School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Abdenacer Naouri: ORCiD; School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Huansheng Ning: ORCiD; School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Sahraoui Dhelim: ORCiD; School of Computer Science, University College Dublin, Dublin 4, Ireland

DOI: https://doi.org/10.1109/ACCESS.2024.3404272
Journal volume & issue: Vol. 12
pp. 72924 – 72937

Abstract

Read online

IoT has recently witnessed a boom in AI deployment at the edge as a result of the newly developed small size Machine Learning (ML) models and integrated hardware accelerators. Although it brings huge benefits such as privacy-preserving and low-latency applications, it still suffers from typical resource limitations of edge devices. A new approach aims to deploy multiple inference models varying in size and accuracy onboard the edge device which could alleviate some of these limitations. This dynamic system can be leveraged to provide real-time energy efficient application by smartly allocating inference tasks to inference local models or offload to edge servers based on current constraints. In this work, we tackle the problem of efficiently allocating inference models for a given set of inference tasks between local inference models and edge server models in parallel under given time and energy constraints. This problem is considered strongly NP-hard and therefore we propose LITOSS, a 2-stage framework in which we use a lightweight Genetic Algorithm-based schemer for task scheduling along with a Reinforcement Learning (RL) agent for improving edge server selection. We perform experiments using a raspberry pi with a set of edge servers. Results show that our framework performed relatively faster compared to other meta-heuristic schemes such as LGSTO, Ant Colony Optimization (ACO) and Particle Swarm Optimization (PSO) while providing higher average accuracy. We also show that using an RL agent to select the best subset of available edge servers increased, or maintained in worst cases, the average accuracy while reducing the average scheduling times.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords