Songklanakarin Journal of Science and Technology (SJST) (Oct 2017)

Boundary expansion algorithm of a decision tree induction for an imbalanced dataset

  • Kesinee Boonchuay,
  • Krung Sinapiromsaran,
  • Chidchanok Lursinsap

DOI
https://doi.org/10.14456/sjst-psu.2017.82
Journal volume & issue
Vol. 39, no. 5
pp. 665 – 673

Abstract

Read online

A decision tree is one of the famous classifiers based on a recursive partitioning algorithm. This paper introduces the Boundary Expansion Algorithm (BEA) to improve a decision tree induction that deals with an imbalanced dataset. BEA utilizes all attributes to define non-splittable ranges. The computed means of all attributes for minority instances are used to find the nearest minority instance, which will be expanded along all attributes to cover a minority region. As a result, BEA can successfully cope with an imbalanced dataset comparing with C4.5, Gini, asymmetric entropy, top-down tree, and Hellinger distance decision tree on 25 imbalanced datasets from the UCI Repository.

Keywords