IEEE Access (Jan 2019)

An Empirical Study on the Effectiveness of Feature Selection for Cross-Project Defect Prediction

  • Qiao Yu,
  • Junyan Qian,
  • Shujuan Jiang,
  • Zhenhua Wu,
  • Gongjie Zhang

DOI
https://doi.org/10.1109/ACCESS.2019.2895614
Journal volume & issue
Vol. 7
pp. 35710 – 35718

Abstract

Read online

Software defect prediction has attracted much attention of researchers in software engineering. At present, feature selection approaches have been introduced into software defect prediction, which can improve the performance of traditional defect prediction (known as within-project defect prediction, WPDP) effectively. However, the studies on feature selection are not sufficient for cross-project defect prediction (CPDP). In this paper, we use the feature subset selection and feature ranking approaches to explore the effectiveness of feature selection for CPDP. An empirical study is conducted on NASA and PROMISE datasets. The results show that both the feature subset selection and feature ranking approaches can improve the performance of CPDP. Therefore, we should select the representative feature subset or set a reasonable proportion of selected features to improve the performance of CPDP in future studies.

Keywords