IEEE Access (Jan 2021)

A Hybrid Feature Selection Method RFSTL for Manufacturing Quality Prediction Based on a High Dimensional Imbalanced Dataset

  • Hong Zhou,
  • Kun-Ming Yu,
  • Yen-Chiu Chen,
  • Huan-Po Hsu

DOI
https://doi.org/10.1109/ACCESS.2021.3059298
Journal volume & issue
Vol. 9
pp. 29719 – 29735

Abstract

Read online

Under Industry 4.0, manufacturing quality prediction has been gaining increased interest from researchers and manufacturers. From the analysis of previous studies on quality predictions using machine learning, it became clear that the high dimensionality and imbalance of data are major and common problems affecting the learning performance. This work uses a hybrid method to address this issue, applying a Synthetic Minority Oversampling Technique & TomekLinks balancing approach to create balanced data and using Random Forest as the feature selecting measurement to reduce the dimensionality of data. In addition, a Fine Gaussian Support Vector Machine (Fine Gaussian SVM) based on the representative set of features selected by the hybrid method utilized is employed in this work to predict product quality. The results of experimentation demonstrate that the hybrid method proposed in this work performs well for manufacturing quality prediction and offers a simple, quick and powerful way to address the problem of feature selection encountered by the imbalanced classification.

Keywords