Enhancing Customer Churn Prediction With Resampling: A Comparative Study

被引:0
|
作者
Ong, Jia-Xuan [1 ]
Tong, Gee-Kok [1 ]
Khor, Kok-Chin [2 ]
Haw, Su-Cheng [1 ]
机构
[1] Multimedia Univ, Fac Comp & Informat, Persiaran Multimedia, Cyberjaya 63100, Selangor, Malaysia
[2] Univ Tunku Abdul Rahman, Lee Kong Chian Fac Engn & Sci, Jalan Sungai Long, Bandar Sungai Long 43000, Kajang, Malaysia
来源
TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS | 2024年 / 13卷 / 03期
关键词
Customer churn prediction; imbalance datasets; resampling; oversampling;
D O I
10.18421/TEM133-20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this competitive business world, accurately predicting customer churn is crucial to maintaining and preventing revenue loss. However, due to the imbalanced nature of customer churn data, traditional machine learning algorithms often fail to identify churned customers accurately. This has led to exploring resampling techniques, demonstrating their efficacy in addressing this issue. However, current studies in the customer churn prediction field frequently overlook the untapped potential of comprehensive investigation and comparison of resampling techniques. Instead of exploring and comparing various resampling methods, many studies predominantly rely on a single resampling method, such as SMOTE. Hence, this paper aims to compare and evaluate the effectiveness of several resampling methods, including oversampling, undersampling, and hybrid techniques. We utilized the benchmark dataset, telecommunication customer churn, from IBM Watson, where approximately 26.5% of the customers have churned, indicating that the data is imbalanced. Our results demonstrate that the combination of random forest with a hybrid sampling method - SMOTE-ENN obtained the best result. The combination yields an F1 score of 95.3% and an accuracy of 96.0%, surpassing the studies that utilized the same dataset. This highlights the benefits of comparing resampling techniques in predicting customer churn, specifically in imbalanced datasets.
引用
收藏
页码:1927 / 1936
页数:10
相关论文
共 50 条
  • [21] Just-in-time customer churn prediction in the telecommunication sector
    Amin, Adnan
    Al-Obeidat, Feras
    Shah, Babar
    Al Tae, May
    Khan, Changez
    Durrani, Hamood Ur Rehman
    Anwar, Sajid
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (06): : 3924 - 3948
  • [22] Supervised Massive Data Analysis for Telecommunication Customer Churn Prediction
    Li, Hui
    Yang, Deliang
    Yang, Lingling
    Lu, Yao
    Lin, Xiaola
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 163 - 169
  • [23] Just-in-time customer churn prediction in the telecommunication sector
    Adnan Amin
    Feras Al-Obeidat
    Babar Shah
    May Al Tae
    Changez Khan
    Hamood Ur Rehman Durrani
    Sajid Anwar
    The Journal of Supercomputing, 2020, 76 : 3924 - 3948
  • [24] Leveraging TabNet for Enhanced Customer Churn Prediction in the Telecommunication Industry
    Alhakim, Muhammad Firdaus
    Petchhan, Jirayu
    Su, Shun-Feng
    2024 11TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN, ICCE-TAIWAN 2024, 2024, : 717 - 718
  • [25] Research of Indicator System in Customer Churn Prediction for Telecom Industry
    Qiu Yihui
    Zhang Chiyu
    2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE), 2016, : 123 - 130
  • [26] Customer Churn Prediction in B2B Contexts
    Figalist, Iris
    Elsner, Christoph
    Bosch, Jan
    Olsson, Helena Holmstrom
    SOFTWARE BUSINESS (ICSOB 2019), 2019, 370 : 378 - 386
  • [27] UNDERSTANDING CUSTOMER CHURN PREDICTION RESEARCH WITH STRUCTURAL TOPIC MODELS
    Fridrich, Martin
    ECONOMIC COMPUTATION AND ECONOMIC CYBERNETICS STUDIES AND RESEARCH, 2020, 54 (04): : 301 - 317
  • [28] Enhancing Customer Churn Prediction in the Banking Sector through Hybrid Segmented Models with Model-Agnostic Interpretability Techniques
    Vashistha, Astha
    Tiwari, Anoop Kumar
    Ghai, Shubhdeep Singh
    Yadav, Paritosh Kumar
    Pandey, Sudhakar
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2024,
  • [29] Variable selection by association rules for customer churn prediction of multimedia on demand
    Tsai, Chih-Fong
    Chen, Mao-Yuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (03) : 2006 - 2015
  • [30] Customer Churn Prediction Approach Based on LLM Embeddings and Logistic Regression
    Chajia, Meryem
    Nfaoui, El Habib
    FUTURE INTERNET, 2024, 16 (12)