Journal of Applied Mathematics (Jan 2015)

Data Transformation Technique to Improve the Outlier Detection Power of Grubbs’ Test for Data Expected to Follow Linear Relation

  • K. K. L. B. Adikaram,
  • M. A. Hussein,
  • M. Effenberger,
  • T. Becker

DOI
https://doi.org/10.1155/2015/708948
Journal volume & issue
Vol. 2015

Abstract

Read online

Grubbs test (extreme studentized deviate test, maximum normed residual test) is used in various fields to identify outliers in a data set, which are ranked in the order of x1≤x2≤x3≤⋯≤xn (i=1,2,3,…,n). However, ranking of data eliminates the actual sequence of a data series, which is an important factor for determining outliers in some cases (e.g., time series). Thus in such a data set, Grubbs test will not identify outliers correctly. This paper introduces a technique for transforming data from sequence bound linear form to sequence unbound form (y=c). Applying Grubbs test to the new transformed data set detects outliers more accurately. In addition, the new technique improves the outlier detection capability of Grubbs test. Results show that, Grubbs test was capable of identifing outliers at significance level 0.01 after transformation, while it was unable to identify those prior to transforming at significance level 0.05.