Applied Sciences (Aug 2022)

Intelligent Decision Forest Models for Customer Churn Prediction

  • Fatima Enehezei Usman-Hamza,
  • Abdullateef Oluwagbemiga Balogun,
  • Luiz Fernando Capretz,
  • Hammed Adeleye Mojeed,
  • Saipunidzam Mahamad,
  • Shakirat Aderonke Salihu,
  • Abimbola Ganiyat Akintola,
  • Shuib Basri,
  • Ramoni Tirimisiyu Amosa,
  • Nasiru Kehinde Salahdeen

DOI
https://doi.org/10.3390/app12168270
Journal volume & issue
Vol. 12, no. 16
p. 8270

Abstract

Read online

Customer churn is a critical issue impacting enterprises and organizations, particularly in the emerging and highly competitive telecommunications industry. It is important to researchers and industry analysts interested in projecting customer behavior to separate churn from non-churn consumers. The fundamental incentive is a firm’s intent desire to keep current consumers, along with the exorbitant expense of gaining new ones. Many solutions have been developed to address customer churn prediction (CCP), such as rule-based and machine learning (ML) solutions. However, the issue of scalability and robustness of rule-based customer churn solutions is a critical drawback, while the imbalanced nature of churn datasets has a detrimental impact on the prediction efficacy of conventional ML techniques in CCP. As a result, in this study, we developed intelligent decision forest (DF) models for CCP in telecommunication. Specifically, we investigated the prediction performances of the logistic model tree (LMT), random forest (RF), and Functional Trees (FT) as DF models and enhanced DF (LMT, RF, and FT) models based on weighted soft voting and weighted stacking methods. Extensive experimentation was performed to ascertain the efficacy of the suggested DF models utilizing publicly accessible benchmark telecom CCP datasets. The suggested DF models efficiently distinguish churn from non-churn consumers in the presence of the class imbalance problem. In addition, when compared to baseline and existing ML-based CCP methods, comparative findings showed that the proposed DF models provided superior prediction performances and optimal solutions for CCP in the telecom industry. Hence, the development and deployment of DF-based models for CCP and applicable ML tasks are recommended.

Keywords