Customer Churn Prediction Using Decision Tree and Random Forest with a SMOTE Approach to Support Customer Intelligence in the Telecommunications Industry
Keywords:
Customer Churn, Data Mining, Decision Tree, Random Forest, SMOTE, Customer Intelligence.Abstract
Customer churn represents a critical issue in the telecommunications sector due to its impact on customer retention levels and long-term business continuity. Consequently, companies require reliable methods to recognize customers who are likely to terminate their subscriptions. This study focuses on predicting customer churn by applying Decision Tree and Random Forest algorithms to the Telco Customer Churn dataset. The research methodology adopts the CRISP-DM framework, which encompasses data preparation, feature engineering, class balancing through the Synthetic Minority Over-sampling Technique (SMOTE), model construction, and performance evaluation. Four classification approaches were examined, including Decision Tree Gini, Decision Tree Entropy, Decision Tree Pre-Pruning, and Random Forest. Hyperparameter tuning was performed using GridSearchCV, whereas model effectiveness was assessed through Accuracy, Precision, Recall, F1-Score, and ROC-AUC metrics. The experimental results reveal that the Random Forest model produced the highest performance, achieving an accuracy of 84.88% and a ROC-AUC value of 92.94%. Furthermore, the feature importance analysis identified Contract, MonthlyCharges, and tenure as the variables with the strongest contribution to churn prediction. These findings indicate that the proposed approach can enhance Customer Intelligence by generating valuable insights into customer behavior and assisting organizations in developing more effective customer retention initiatives
References
Ahmad, A. K., Jafar, A., & Aljouie, A. (2023). Customer Churn Prediction Using Machine Learning Approaches. IEEE Access.
Alotaibi, F., & Haq, E. U. (2024). Customer Churn Prediction for Telecommunication Industry Using Machine Learning and Ensemble Methods. Engineering, Technology & Applied Science
Breiman, L. (2023). Random Forests. Springer.
Fernandez, A., Garcia, S., & Galar, M. (2022). Learning from Imbalanced Data Sets. Springer.
Han, J., Kamber, M., & Pei, J. (2022). Data Mining: Concepts and Techniques. Morgan Kaufmann.
Huang, Y., Zhang, H., & Li, X. (2023). Customer Churn Prediction Using Machine Learning Techniques. Electronics.
Ullah, M. (2022). Customer Churn Prediction Using SMOTE and Machine Learning. Applied Sciences.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Muhammad Irfan Fauzi, Nurul Fitriyah, Mufidah Karimah (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.
Creative Commons Attribution 4.0 International (CC BY 4.0).


This work is licensed under a