IEEE Access (Jan 2020)

PTAOD: A Novel Framework for Supporting Approximate Outlier Detection Over Streaming Data for Edge Computing

  • Rui Zhu,
  • Tiantian Yu,
  • Zhiyuan Tan,
  • Wei Du,
  • Liang Zhao,
  • Jiajia Li,
  • Xiufeng Xia

DOI
https://doi.org/10.1109/ACCESS.2019.2962066
Journal volume & issue
Vol. 8
pp. 1475 – 1485

Abstract

Read online

Outlier detection over sliding window is a fundamental problem in the domain of streaming data management, which has has been studied over 10 years. The key to supporting outlier detection is to construct a neighbour list for each object, which is used for predicting which objects may become outliers or are impossible to become outliers. However, existing work ignores the fact that, outliers amount is usually small, in which it is unnecessary to construct neighbour-list for all objects when they arrive in the window. It causes both high space and computational cost, which turns the solution infeasible for working under edge computation environment. In this paper, we propose a novel framework named PTAOD (Probabilistic Threshold-based Approximate Outlier Detection). Firstly, we propose an algorithm for evaluating the probability of a newly arrived object becoming an outlier before it expires from the window, using evaluating result for avoiding unnecessary candidate maintenance. In addition, we introduce a novel index namely ZHB-Tree (Z-order-based Hash B-Tree) to maintain streaming data. Last of all, we propose a novel algorithm to maintain candidate outliers. Theoretical analysis and extensive experimental results demonstrate the effectiveness of the proposed algorithms.

Keywords