Frontiers in Physiology (Oct 2023)

Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data

  • Jia Li,
  • Jia Li,
  • Jiangwei Li,
  • Chenxu Wang,
  • Chenxu Wang,
  • Fons J. Verbeek,
  • Tanja Schultz,
  • Hui Liu

DOI
https://doi.org/10.3389/fphys.2023.1233341
Journal volume & issue
Vol. 14

Abstract

Read online

As an important technique for data pre-processing, outlier detection plays a crucial role in various real applications and has gained substantial attention, especially in medical fields. Despite the importance of outlier detection, many existing methods are vulnerable to the distribution of outliers and require prior knowledge, such as the outlier proportion. To address this problem to some extent, this article proposes an adaptive mini-minimum spanning tree-based outlier detection (MMOD) method, which utilizes a novel distance measure by scaling the Euclidean distance. For datasets containing different densities and taking on different shapes, our method can identify outliers without prior knowledge of outlier percentages. The results on both real-world medical data corpora and intuitive synthetic datasets demonstrate the effectiveness of the proposed method compared to state-of-the-art methods.

Keywords