A Literature Survey and Experimental Evaluation of the State-of-the-Art in Uplift Modeling: A Stepping Stone Toward the Development of Prescriptive Analytics

被引:74
作者
Devriendt, Floris [1 ,2 ]
Moldovan, Darie [3 ]
Verbeke, Wouter [1 ,2 ]
机构
[1] Vrije Univ Brussel, Fac Econ & Social Sci, B-1050 Brussels, Belgium
[2] Vrije Univ Brussel, Solvay Business Sch, B-1050 Brussels, Belgium
[3] Babes Bolyai Univ, Business Informat Syst Dept, Cluj Napoca, Romania
关键词
uplift modeling; prescriptive analytics; literature survey; experimental evaluation; performance measures; profit-driven analytics; RANDOMIZED-TRIALS; NONCOMPLIANCE;
D O I
10.1089/big.2017.0104
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prescriptive analytics extends on predictive analytics by allowing to estimate an outcome in function of control variables, allowing as such to establish the required level of control variables for realizing a desired outcome. Uplift modeling is at the heart of prescriptive analytics and aims at estimating the net difference in an outcome resulting from a specific action or treatment that is applied. In this article, a structured and detailed literature survey on uplift modeling is provided by identifying and contrasting various groups of approaches. In addition, evaluation metrics for assessing the performance of uplift models are reviewed. An experimental evaluation on four real-world data sets provides further insight into their use. Uplift random forests are found to be consistently among the best performing techniques in terms of the Qini and Gini measures, although considerable variability in performance across the various data sets of the experiments is observed. In addition, uplift models are frequently observed to be unstable and display a strong variability in terms of performance across different folds in the cross-validation experimental setup. This potentially threatens their actual use for business applications. Moreover, it is found that the available evaluation metrics do not provide an intuitively understandable indication of the actual use and performance of a model. Specifically, existing evaluation metrics do not facilitate a comparison of uplift models and predictive models and evaluate performance either at an arbitrary cutoff or over the full spectrum of potential cutoffs. In conclusion, we highlight the instability of uplift models and the need for an application-oriented approach to assess uplift models as prime topics for further research.
引用
收藏
页码:13 / 41
页数:29
相关论文
共 49 条
[1]  
[Anonymous], 2014, DATA MIN KNOWL DISC
[2]  
[Anonymous], 2006, THESIS
[3]  
[Anonymous], 1980, J Roy Stat Soc: Ser C (Appl Stat), DOI [DOI 10.2307/2986296, 10.2307/2986296]
[4]  
Baesens B., 2014, ANAL BIG DATA WORLD
[5]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
Breiman L, 1998, ANN STAT, V26, P801
[9]  
Cao Y, 2017, MIDW SAS US GROUP C, pBF03
[10]  
Chickering D.M., 2000, PROCEEDINGS OF THE SIXTEENTH CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, UAI'00, P82