A Novel Profit Maximizing Metric for Measuring Classification Performance of Customer Churn Prediction Models

被引:110
作者
Verbraken, Thomas [1 ]
Verbeke, Wouter [2 ]
Baesens, Bart [1 ,3 ]
机构
[1] Katholieke Univ Leuven, Dept Decis Sci & Informat Management, B-3000 Louvain, Belgium
[2] Univ Edinburgh, Sch Business, Edinburgh EH8 9JS, Midlothian, Scotland
[3] Univ Southampton, Sch Management, Southampton SO17 1BJ, Hants, England
关键词
Data mining; classification; performance measures; DECISION TABLES; NETWORKS; ACCURACY; AREA;
D O I
10.1109/TKDE.2012.50
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The interest for data mining techniques has increased tremendously during the past decades, and numerous classification techniques have been applied in a wide range of business applications. Hence, the need for adequate performance measures has become more important than ever. In this paper, a cost-benefit analysis framework is formalized in order to define performance measures which are aligned with the main objectives of the end users, i.e., profit maximization. A new performance measure is defined, the expected maximum profit criterion. This general framework is then applied to the customer churn problem with its particular cost-benefit structure. The advantage of this approach is that it assists companies with selecting the classifier which maximizes the profit. Moreover, it aids with the practical implementation in the sense that it provides guidance about the fraction of the customer base to be included in the retention campaign.
引用
收藏
页码:961 / 973
页数:13
相关论文
共 29 条
[1]   On learning algorithm selection for classification [J].
Ali, S ;
Smith, KA .
APPLIED SOFT COMPUTING, 2006, 6 (02) :119-138
[2]  
[Anonymous], 2006, Introduction to Data Mining
[3]  
[Anonymous], INT JOINT C ART INT
[4]   Using neural network rule extraction and decision tables for credit-risk evaluation [J].
Baesens, B ;
Setiono, R ;
Mues, C ;
Vanthienen, J .
MANAGEMENT SCIENCE, 2003, 49 (03) :312-329
[5]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[6]   Toward intelligent assistance for a data mining process: An ontology-based approach for cost-sensitive classification [J].
Bernstein, A ;
Provost, F ;
Hill, S .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (04) :503-518
[7]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[8]   CRM at a pay-TV company: Using analytical models to reduce customer attrition by targeted marketing for subscription services [J].
Burez, Jonathan ;
Van den Poel, Dirk .
EXPERT SYSTEMS WITH APPLICATIONS, 2007, 32 (02) :277-288
[9]  
Chawla NV, 2010, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, SECOND EDITION, P875, DOI 10.1007/978-0-387-09823-4_45
[10]   Machine learning for direct marketing response models: Bayesian networks with evolutionary programming [J].
Cui, G ;
Wong, ML ;
Lui, HK .
MANAGEMENT SCIENCE, 2006, 52 (04) :597-612