Handling Imbalanced Data in Customer Churn Prediction Using Combined Sampling and Weighted Random Forest

被引:0
作者
Effendy, Veronikha
Adiwijaya
Baizal, Z. K. A.
机构
来源
2014 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT) | 2014年
关键词
Churn; Prediction; Weighted Random Forest; Combined-sampling; simple under sampling; SMOTE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Customer churn is a major problem that is found in the telecommunications industry because it affects the company's revenue. At the time of the customer churn is taking place, the percentage of data that describes the customer churn is usually low. Unfortunately, the churn data is the data which have to be predicted earlier. The lack of data on customer churn led to the problem of imbalanced data. The imbalanced data caused difficulties in developing a good prediction model. This research applied a combination of sampling techniques and Weighted Random Forest (WRF) to improve the customer churn prediction model on a sample dataset from a telecommunication industry in Indonesia. WRF claimed can produce a prediction model which has a good performance on the imbalanced data problem. However, this research found that the performance of the prediction model developed by WRF using the dataset is still quite low. Sampling techniques were applied to overcome this problem. This research used the combination of simple under sampling and SMOTE. The result shown that the combined-sampling and WRF could produce a prediction model which had better performance than before.
引用
收藏
页数:6
相关论文
共 11 条
[1]  
[Anonymous], 2006, Introduction to Data Mining
[2]  
[Anonymous], 2004, USING RANDOM FOREST
[3]  
Baizal dan Z. A., 2009, ANAL PENGARUH METODE
[4]  
Breimann L., 2001, RANDOM FOREST
[5]  
Burez J., 2009, EXPERT SYSTEMS APPL, V36, P4626
[6]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[7]   Customer churn prediction in telecommunications [J].
Huang, Bingquan ;
Kechadi, Mohand Tahar ;
Buckley, Brian .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) :1414-1425
[8]  
Maharaj D. M., 2011, IJBSM INT J BUSINESS, V1
[9]  
Mattison R., 2005, The Telco Churn Management Handbook
[10]   Defection detection: Measuring and understanding the predictive accuracy of customer churn models [J].
Neslin, SA ;
Gupta, S ;
Kamakura, W ;
Lu, JX ;
Mason, CH .
JOURNAL OF MARKETING RESEARCH, 2006, 43 (02) :204-211