Algorithms (Sep 2012)

Monitoring Threshold Functions over Distributed Data Streams with Node Dependent Constraints

  • Yaakov Malinovsky,
  • Jacob Kogan

DOI
https://doi.org/10.3390/a5030379
Journal volume & issue
Vol. 5, no. 3
pp. 379 – 397

Abstract

Read online

Monitoring data streams in a distributed system has attracted considerable interest in recent years. The task of feature selection (e.g., by monitoring the information gain of various features) requires a very high communication overhead when addressed using straightforward centralized algorithms. While most of the existing algorithms deal with monitoring simple aggregated values such as frequency of occurrence of stream items, motivated by recent contributions based on geometric ideas we present an alternative approach. The proposed approach enables monitoring values of an arbitrary threshold function over distributed data streams through stream dependent constraints applied separately on each stream. We report numerical experiments on a real-world data that detect instances where communication between nodes is required, and compare the approach and the results to those recently reported in the literature.

Keywords