Optimizing predictive precision in imbalanced datasets for actionable revenue change prediction

被引:12
作者
Mahajan, Pravar Dilip [1 ]
Maurya, Abhinav [1 ,2 ]
Megahed, Aly [1 ]
Elwany, Alaa [3 ]
Strong, Ray [1 ]
Blomberg, Jeanette [1 ]
机构
[1] IBM Res Almaden, San Jose, CA 95120 USA
[2] Amazon Web Serv, Seattle, WA 98121 USA
[3] Texas A&M Univ, Dept Ind & Syst Engn, College Stn, TX 77843 USA
关键词
Analytics; Revenue change prediction; Classification; Machine learning; Bayesian optimization; Imbalanced datasets; CUSTOMER CHURN PREDICTION; TELECOMMUNICATION SECTOR; AUC OPTIMIZATION; MACHINE; SMOTE;
D O I
10.1016/j.ejor.2020.02.036
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
In business environments where an organization offers contract-based periodic services to its clients, one crucial task is to predict changes in revenues generated through different clients or specific service offerings from one time epoch to another. This is commonly known as the revenue change prediction problem. In practical real-world environments, the importance of having adequate revenue change prediction capability primarily stems from scarcity of resources (in particular, sales team personnel or technical consultants) that are needed to respond to different revenue change scenarios including predicted revenue growth or shrinkage. It becomes important to make actionable decisions; that is, decisions related to prioritizing clients or service offerings to which these scarce resources are to be allocated. The contribution of the current work is twofold. First, we propose a framework for conducting revenue change prediction through casting it as a classification problem. Second, since datasets associated with revenue change prediction are typically imbalanced, we develop a new methodology for solving the classification problem such that we achieve maximum prediction precision while minimizing sacrifice in prediction accuracy. We validate our proposed framework through real-world datasets acquired from a major global provider of cloud computing services, and benchmark its performance against standard classifiers from previous works in the literature. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:1095 / 1113
页数:19
相关论文
共 55 条
[1]   Customer churn prediction in the telecommunication sector using a rough set approach [J].
Amin, Adnan ;
Anwar, Sajid ;
Adnan, Awais ;
Nawaz, Muhammad ;
Alawfi, Khalid ;
Hussain, Amir ;
Huang, Kaizhu .
NEUROCOMPUTING, 2017, 237 :242-254
[2]  
[Anonymous], COMPUTATIONAL INTELL
[3]  
[Anonymous], 1995, ENCY MACHINE LEARNIN, DOI DOI 10.1007/978-0-387-34870-422
[4]  
[Anonymous], 2014, INT J COMPUT BUS RES
[5]  
[Anonymous], CTI RES S
[6]  
[Anonymous], KDD CUP 2004 PART PH
[7]   Semi-Supervised Deep Fuzzy C-Mean Clustering for Imbalanced Mulit-Class Classification [J].
Arshad, Ali ;
Riaz, Saman ;
Jiao, Licheng .
IEEE ACCESS, 2019, 7 :28100-28112
[8]   An empirical comparison on state-of-the-art multi-class imbalance learning algorithms and a new diversified ensemble learning scheme [J].
Bi, Jingjun ;
Zhang, Chongsheng .
KNOWLEDGE-BASED SYSTEMS, 2018, 158 :81-93
[9]  
Bishop C.M., 2006, J ELECTRON IMAGING, V16, P049901, DOI [DOI 10.5194/NHESS-18-2769-2018, 10.1117/1.2819119]
[10]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771