An empirical comparison of techniques for the class imbalance problem in churn prediction

被引：106

作者：

Zhu, Bing ^{[1
,2
]}

Baesens, Bart ^{[2
,3
]}

vanden Broucke, Seppe K. L. M. ^{[2
]}

机构：

[1] Sichuan Univ, Business Sch, Chengdu 610064, Peoples R China

[2] Katholieke Univ Leuven, Dept Decis Sci & Informat Management, B-3000 Leuven, Belgium

[3] Univ Southampton, Sch Management, Southampton SO17 1BJ, Hants, England

来源：

INFORMATION SCIENCES | 2017年 / 408卷

基金：

中国国家自然科学基金;

关键词：

Churn prediction; Class imbalance; Benchmark experiment; Expected maximum profit measure; CUSTOMER CHURN; CLASSIFICATION; FRAMEWORK; MACHINE;

D O I：

10.1016/j.ins.2017.04.015

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Class imbalance brings significant challenges to customer churn prediction. Many solutions have been developed to address this issue. In this paper, we comprehensively compare the performance of state-of-the-art techniques to deal with class imbalance in the context of churn prediction. A recently developed expected maximum profit criterion is used as one of the main performance measures to offer more insights from the perspective of cost-benefit. The experimental results show that the applied evaluation metric has a great impact on the performance of techniques. An in-depth exploration of reaction patterns to different measures is conducted by intra-family comparison within each solution group and global comparison among the representative techniques from different groups. The results also indicate there is much space to improve solutions' performance in terms of profit-based measure. Our study offers valuable insights for academics and professionals and it also provides a baseline to develop new methods for dealing with class imbalance in churn prediction. (C) 2017 Elsevier Inc. All rights reserved.

引用

页码：84 / 99

页数：16

共 46 条

[1] Dynamic churn prediction framework with more effective use of rare event data: The case of private banking [J].