A Big Data-Driven Hybrid Model for Enhancing Streaming Service Customer Retention Through Churn Prediction Integrated With Explainable AI

被引:2
作者
Gani Joy, Usman [1 ]
Hoque, Kazi Ekramul [1 ]
Nazim Uddin, Mohammed [1 ]
Chowdhury, Linkon [1 ]
Park, Seung-Bo [2 ]
机构
[1] East Delta Univ, Sch Sci Engn & Technol, Chattagram 4209, Bangladesh
[2] Inha Univ, Dept Software Convergence Engn, Incheon 22212, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
Artificial intelligence; classification algorithms; deep learning; decision support systems; explainable AI; model interpretation; semi-supervised learning; big data analysis; NEURAL-NETWORKS; SELECTION;
D O I
10.1109/ACCESS.2024.3401247
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Customer churn prediction is a critical issue that streaming services face as retaining existing subscribers is vital to the success of the business. Creating reliable churn prediction models is important because the costs of acquiring new customers are usually higher than those involved in retaining existing ones. In this study, we propose a big data-driven hybrid model combining a deep neural network with a machine-learning model to efficiently forecast customer churn. Our proposed model uses Long Short-Term Memory (LSTM) with a Gated Recurrent Unit (GRU) to capture the trends in subscribers' usage patterns over time. In addition, light gradient boosting (Light GBM) is used to leverage insights from sequential modeling along with original attributes to forecast churn. Moreover, feature selection techniques like Chi-squared testing and Sequential Feature Selection (SFS) are utilized to choose the optimum set of features for our proposed model. Furthermore, several individual models, including deep learning and traditional machine learning algorithms are also evaluated and compared with our proposed hybrid model. Additionally, the study illustrates model interpretations using Shapley Additive Explanations (SHAP) and Explainable Boosting Machine (EBM) which are used for identifying influential features in streaming services enhancing customer retention efforts. These techniques provide transparency into our proposed model's forecasting, making them more actionable and understandable for decision-makers. Extensive experimental evaluation demonstrates the hybrid model achieves best-in-class performance with 95.60% AUC and 90.09% F1 score.
引用
收藏
页码:69130 / 69150
页数:21
相关论文
共 67 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Customer churn prediction in telecom using machine learning in big data platform
    Ahmad, Abdelrahim Kasem
    Jafar, Assef
    Aljoumaa, Kadan
    [J]. JOURNAL OF BIG DATA, 2019, 6 (01)
  • [3] Ahmed A. A. Q, 2017, IOSR-JCE, V19, P30
  • [4] Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance
    Ahsan, Md Manjurul
    Mahmud, M. A. Parvez
    Saha, Pritom Kumar
    Gupta, Kishor Datta
    Siddique, Zahed
    [J]. TECHNOLOGIES, 2021, 9 (03)
  • [5] Customer Churn Prediction in telecommunication Industry: with and without Counter-Example
    Amin, Adnan
    Khan, Changez
    Ali, Imtiaz
    Anwar, Sajid
    [J]. 2014 EUROPEAN NETWORK INTELLIGENCE CONFERENCE (ENIC), 2014, : 134 - 137
  • [6] Assessing the data complexity of imbalanced datasets
    Barella, Victor H.
    Garcia, Luis P. F.
    de Souto, Marcilio C. P.
    Lorena, Ana C.
    de Carvalho, Andre C. P. L. F.
    [J]. INFORMATION SCIENCES, 2021, 553 : 83 - 109
  • [7] Batista GEAPA., 2004, ACM SIGKDD EXPL NEWS, V6, P20, DOI [DOI 10.1145/1007730.1007735, 10.1145/1007730.1007735, 10.1145/1007730.1007735.2]
  • [8] Selection of relevant features and examples in machine learning
    Blum, AL
    Langley, P
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 245 - 271
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] Handling class imbalance in customer churn prediction
    Burez, J.
    Van den Poel, D.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 4626 - 4636