Predicting customer retention and profitability by using random forests and regression forests techniques

被引:198
作者
Larivière, B [1 ]
Van den Poel, D [1 ]
机构
[1] Univ Ghent, Dept Mkt, B-9000 Ghent, Belgium
关键词
data mining; customer relationship management; customer retention and profitability; random forests and regression forests;
D O I
10.1016/j.eswa.2005.04.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an era of strong customer relationship management (CRM) emphasis, firms strive to build valuable relationships with their existing customer base. In this study, we attempt to better understand three important measures of customer outcome: next buy, partial-defection and customers' profitability evolution. By means of random forests techniques we investigate a broad set of explanatory variables, including past customer behavior, observed customer heterogeneity and some typical variables related to intermediaries. We analyze a real-life sample of 100,000 customers taken from the data warehouse of a large European financial services company. Two types of random forests techniques are employed to analyze the data: random forests are used for binary classification, whereas regression forests are applied for the models with linear dependent variables. Our research findings demonstrate that both random forests techniques provide better fit for the estimation and validation sample compared to ordinary linear regression and logistic regression models. Furthermore, we find evidence that the same set of variables have a different impact on buying versus defection versus profitability behavior. Our findings suggest that past customer behavior is more important to generate repeat purchasing and favorable profitability evolutions, while the intermediary's role has a greater impact on the customers' defection proneness. Finally, our results demonstrate the benefits of analyzing different customer outcome variables simultaneously, since an extended investigation of the next buy-partial-defection-customer profitability triad indicates that one cannot fully understand a particular outcome without understanding the other related behavioral outcome variables. (C) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:472 / 484
页数:13
相关论文
共 30 条
[1]  
[Anonymous], INT J BANK MARKETING
[2]  
[Anonymous], 1999, KIDS MARKET MYTHS RE
[3]   Customer satisfaction cues to support market segmentation and explain switching behavior [J].
Athanassopoulos, AD .
JOURNAL OF BUSINESS RESEARCH, 2000, 47 (03) :191-207
[4]   Bayesian network classifiers for identifying the slope of the customer lifecycle of long-life customers [J].
Baesens, B ;
Verstraeten, G ;
Van den Poel, D ;
Egmont-Petersen, M ;
Van Kenhove, P ;
Vanthienen, J .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2004, 156 (02) :508-523
[5]   Bayesian neural network learning for repeat purchase modelling in direct marketing [J].
Baesens, B ;
Viaene, S ;
Van den Poel, D ;
Vanthienen, J ;
Dedene, G .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2002, 138 (01) :191-211
[6]   When customers are members: Customer retention in paid membership contexts [J].
Bhattacharya, CB .
JOURNAL OF THE ACADEMY OF MARKETING SCIENCE, 1998, 26 (01) :31-44
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting [J].
Buckinx, W ;
Van den Poel, D .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2005, 164 (01) :252-268
[9]   Implementing a customer relationship strategy: The asymmetric impact of poor versus excellent execution [J].
Colgate, MR ;
Danaher, PJ .
JOURNAL OF THE ACADEMY OF MARKETING SCIENCE, 2000, 28 (03) :375-387
[10]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845