IEEE Access (Jan 2024)

Advancing Bankruptcy Forecasting With Hybrid Machine Learning Techniques: Insights From an Unbalanced Polish Dataset

  • Ummey Hany Ainan,
  • Lip Yee Por,
  • Yen-Lin Chen,
  • Jing Yang,
  • Chin Soon Ku

DOI
https://doi.org/10.1109/ACCESS.2024.3354173
Journal volume & issue
Vol. 12
pp. 9369 – 9381

Abstract

Read online

The challenge of bankruptcy prediction, critical for averting financial sector losses, is amplified by the prevalence of imbalanced datasets, which often skew prediction models. Addressing this, our study introduces the innovative hybrid model XGBoost+ANN, designed to leverage the strengths of both ensemble learning and artificial neural networks. This model integrates a comprehensive set of features with parameters optimized through genetic algorithms, eschewing traditional feature selection approaches. Our research focuses on an unbalanced dataset of Polish companies and reveals that the XGBoost+ANN model, in particular, exhibits outstanding performance. Optimized using genetic algorithms and without feature selection, this model achieved the highest AUC (0.958), sensitivity (0.752), and accuracy (0.983) scores, surpassing other models in our study. This remarkable outperformance, along with the robust results, marks a substantial advancement in the field of bankruptcy prediction. It underscores the efficacy of our approach in addressing the persistent challenge of data imbalance, offering a more reliable and accurate solution for financial risk assessment.

Keywords