Hybrid Models Using Unsupervised Clustering for Prediction of Customer Churn

被引:39
作者
Bose, Indranil [2 ]
Chen, Xi [1 ]
机构
[1] Zhejiang Univ, Sch Management, Hangzhou 310003, Zhejiang, Peoples R China
[2] Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
关键词
churn; clustering; data mining; decision trees; lift; prediction; rules; ALGORITHM; INFORMATION; MAPS;
D O I
10.1080/10919390902821291
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Churn management is one of the key issues handled by mobile telecommunication operators. Data mining techniques can help in the prediction of churn behavior of customers. Various supervised learning techniques have been used to study customer churn. However, research on the use of unsupervised learning techniques for prediction of churn is limited. In this article, we use two-stage hybrid models consisting of unsupervised clustering techniques and decision trees with boosting on two different data sets and evaluate the models in terms of top decile lift. We examine two different approaches for hybridization of the models for utilizing the results of clustering based on various attributes related to services usage and revenue contribution of customers. The results indicate that the use of clustering led to improved top decile lift for the hybrid models compared with the benchmark case when no clustering is used. It is also shown that using cluster labels as inputs to the decision trees is a preferred method of hybridization. Out of the five unsupervised clustering techniques used, none is found to dominate others. But interesting attributes and rules that can help marketing experts identify churners from the data are obtained from the best hybrid models.
引用
收藏
页码:133 / 151
页数:19
相关论文
共 42 条
[1]   A CLUSTERING-ALGORITHM FOR COMPUTER-ASSISTED PROCESS ORGANIZATION [J].
ARONSON, JE ;
KLEIN, G .
DECISION SCIENCES, 1989, 20 (04) :730-745
[2]   A novel evolutionary data mining algorithm with applications to churn prediction [J].
Au, WH ;
Chan, KCC ;
Yao, X .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2003, 7 (06) :532-545
[3]  
Berson A., 2002, Building Data Mining Applications for CRM
[4]   Some new indexes of cluster validity [J].
Bezdek, JC ;
Pal, NR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03) :301-315
[5]  
BHATTACHARYYA S, 2000, P 6 ACM SIGKDD INT C, P248
[6]   A dynamic model of the duration of the customer's relationship with a continuous service provider: The role of satisfaction [J].
Bolton, RN .
MARKETING SCIENCE, 1998, 17 (01) :45-65
[7]   CRM at a pay-TV company: Using analytical models to reduce customer attrition by targeted marketing for subscription services [J].
Burez, Jonathan ;
Van den Poel, Dirk .
EXPERT SYSTEMS WITH APPLICATIONS, 2007, 32 (02) :277-288
[8]   Toward a hybrid data mining model for customer retention [J].
Chu, Bong-Horng ;
Tsai, Ming-Shian ;
Ho, Cheng-Seen .
KNOWLEDGE-BASED SYSTEMS, 2007, 20 (08) :703-718
[9]   Data mining with combined use of optimization techniques and self-organizing maps for improving risk grouping rules: Application to prostate cancer patients [J].
Churilov, L ;
Bagirov, A ;
Schwartz, D ;
Smith, K ;
Dally, M .
JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 2005, 21 (04) :85-100
[10]   Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques [J].
Coussement, Kristof ;
Van den Poel, Dirk .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (01) :313-327