Discover Applied Sciences (Apr 2025)

Enhancing predictive maintenance in automotive industry: addressing class imbalance using advanced machine learning techniques

  • Yashashree Mahale,
  • Shrikrishna Kolhar,
  • Anjali S. More

DOI
https://doi.org/10.1007/s42452-025-06827-3
Journal volume & issue
Vol. 7, no. 4
pp. 1 – 21

Abstract

Read online

Abstract Predictive maintenance is an important application in the automotive industry to enhance vehicle reliability and reducing operational downtime. However, the major challenge with the predictive maintenance types of datasets is the class imbalance, where failure instances are scarce. In this study, a binary classification task is experimented leveraging advanced data imbalance handling techniques to predict the failure instances. The on-board diagnostic dataset utilized has only 16.3% of the failure data, and to address this, 3 key approaches were explored: [i] synthetic minority oversampling technique (SMOTE), [ii] cost-sensitive learning, [iii] ensemble methods. Six machine learning models, including logistic regression, support vector machine, decision tree, and random forest, along with gradient boosting algorithms using extreme gradient boost (XGBoost) and light gradient boosting machine frameworks, were implemented. SMOTE showed a statistically significant improvement in the F1-score with a significance value of 0.0010 and ROC-AUC with p = 0.0015, confirming the effectiveness of SMOTE in handling class imbalance. Advanced ensemble models like RUSBoost, balanced bagging, easy-ensemble, and balanced random forest were implemented where XGBoost yielded the highest performance, attaining scores of 1.000. for precision, recall and F1-score. SMOTE improved minority class representation, achieving a f1-score of up to 99.54%. The integration of multiple imbalance-handling techniques demonstrated superior performances. The scope of the study is limited to engine parameters; however, the results demonstrate the effectiveness of combining traditional and ensemble classifiers to handle imbalance. This work provides a reliable framework with potential applications in fleet management, automotive manufacturing, and smart vehicle diagnostics and also extends beyond engine performance monitoring to environmental and safety assessments.

Keywords