Computers (Feb 2020)

A Taxonomy of Techniques for SLO Failure Prediction in Software Systems

  • Johannes Grohmann,
  • Nikolas Herbst,
  • Avi Chalbani,
  • Yair Arian,
  • Noam Peretz,
  • Samuel Kounev

DOI
https://doi.org/10.3390/computers9010010
Journal volume & issue
Vol. 9, no. 1
p. 10

Abstract

Read online

Failure prediction is an important aspect of self-aware computing systems. Therefore, a multitude of different approaches has been proposed in the literature over the past few years. In this work, we propose a taxonomy for organizing works focusing on the prediction of Service Level Objective (SLO) failures. Our taxonomy classifies related work along the dimensions of the prediction target (e.g., anomaly detection, performance prediction, or failure prediction), the time horizon (e.g., detection or prediction, online or offline application), and the applied modeling type (e.g., time series forecasting, machine learning, or queueing theory). The classification is derived based on a systematic mapping of relevant papers in the area. Additionally, we give an overview of different techniques in each sub-group and address remaining challenges in order to guide future research.

Keywords