IEEE Access (Jan 2020)

Incremental Attribute Reduction Method Based on Chi-Square Statistics and Information Entropy

  • Na Su,
  • Xinjun An,
  • Changqing Yan,
  • Shujuan Ji

DOI
https://doi.org/10.1109/ACCESS.2020.2997013
Journal volume & issue
Vol. 8
pp. 98234 – 98243

Abstract

Read online

Attributes in datasets are usually not equally significant. Some attributes are unnecessary or redundant. Attribute reduction is an important research issue of rough set theory, which can find minimum subsets of attributes with the same classification effect as the whole dataset by removing unnecessary or redundant attributes. We use Chi-square statistics to evaluate the significance of condition attributes. It can reduce the search space of attribute reduction and improve the speed of attribute reduction. Conditional entropy of relative attributes is adopted as a heuristic function. Two decision table reduction algorithms, forward selection and backward deletion, are proposed to approach the optimal solution. Based on this, an efficient incremental attribute reduction method for dynamically changing datasets is proposed by preserving intermediate variables. The intermediate variable is the observation frequency matrix of joint events of each condition attribute and decision attribute. Experimental results show that the proposed algorithms can improve performance in terms of processing time.

Keywords