IEEE Access (Jan 2024)

Customer Personality Analysis for Churn Prediction Using Hybrid Ensemble Models and Class Balancing Techniques

  • Noman Ahmad,
  • Mazhar Javed Awan,
  • Haitham Nobanee,
  • Azlan Mohd Zain,
  • Ansar Naseem,
  • Amena Mahmoud

DOI
https://doi.org/10.1109/ACCESS.2023.3334641
Journal volume & issue
Vol. 12
pp. 1865 – 1879

Abstract

Read online

Today’s businesses rely heavily on focused marketing to improve their chances of growing and keeping their consumer base. Internet behemoths like Google and Facebook have expanded their business models around targeted advertisements that support business growth. Customer personality identification helps for churn prediction for companies. This problem arises in several companies where customer leaves companies for many reasons. This gap leads to conduct study for customer personality analysis. The collected dataset was highly imbalanced in nature. Two class balancing approaches CTGAN (Conditional tabular Generative adversarial networks) and SMOTE (Synthetic minority oversampling technique) has been utilized to equalize the both classes. There are three ensemble approaches such as bagging, boosting and stacking have been utilized for modeling purpose bagging approach uses Random Forest (RF) boosting utilizes XGBoost (XGB), Light Gradient Boosting Machine (LGBM) and ADA Boost (ADA B). The proposed Hybrid Model HSLR comprises of RF, XGB, ADA Boost, LGBM approaches as base classifiers and LR as a Meta classifier. Three testing independent set, k-fold with 5 and 10 folds have been utilized. To evaluate the performance of classifiers evaluation metrics such as Accuracy score, Precision, Recall, F1 score, MCC and ROC score have been utilized. The SMOTE generated data has shown results as compare with CTGAN generated data. The SMOTE approach has shown the highest results of 94.06, 94.23, 94.28, 94.05, 88.13 and 0.984 as accuracy score, Precision, recall, F1, MCC and Roc score respectively.

Keywords