Water (Jul 2021)

Anomaly Detection Using a Sliding Window Technique and Data Imputation with Machine Learning for Hydrological Time Series

  • Lattawit Kulanuwat,
  • Chantana Chantrapornchai,
  • Montri Maleewong,
  • Papis Wongchaisuwat,
  • Supaluk Wimala,
  • Kanoksri Sarinnapakorn,
  • Surajate Boonya-aroonnet

DOI
https://doi.org/10.3390/w13131862
Journal volume & issue
Vol. 13, no. 13
p. 1862

Abstract

Read online

Water level data obtained from telemetry stations typically contains large number of outliers. Anomaly detection and a data imputation are necessary steps in a data monitoring system. Anomaly data can be detected if its values lie outside of a normal pattern distribution. We developed a median-based statistical outlier detection approach using a sliding window technique. In order to fill anomalies, various interpolation techniques were considered. Our proposed framework exhibited promising results after evaluating with F1-score and root mean square error (RMSE) based on our artificially induced data points. The present system can also be easily applied to various patterns of hydrological time series with diverse choices of internal methods and fine-tuned parameters. Specifically, the Spline interpolation method yielded a superior performance on non-cyclical data while the long short-term memory (LSTM) outperformed other interpolation methods on a distinct tidal data pattern.

Keywords