IEEE Access (Jan 2020)
Incremental Attribute Reduction Method Based on Chi-Square Statistics and Information Entropy
Abstract
Attributes in datasets are usually not equally significant. Some attributes are unnecessary or redundant. Attribute reduction is an important research issue of rough set theory, which can find minimum subsets of attributes with the same classification effect as the whole dataset by removing unnecessary or redundant attributes. We use Chi-square statistics to evaluate the significance of condition attributes. It can reduce the search space of attribute reduction and improve the speed of attribute reduction. Conditional entropy of relative attributes is adopted as a heuristic function. Two decision table reduction algorithms, forward selection and backward deletion, are proposed to approach the optimal solution. Based on this, an efficient incremental attribute reduction method for dynamically changing datasets is proposed by preserving intermediate variables. The intermediate variable is the observation frequency matrix of joint events of each condition attribute and decision attribute. Experimental results show that the proposed algorithms can improve performance in terms of processing time.
Keywords