Identifying Soccer Players on Facebook Through Predictive Analytics

被引:8
作者
Bogaert, Matthias [1 ]
Ballings, Michel [2 ]
Hosten, Martijn [1 ]
Van den Poel, Dirk [1 ]
机构
[1] Univ Ghent, Dept Mkt, B-9000 Ghent, Belgium
[2] Univ Tennessee, Dept Business Analyt & Stat, Knoxville, TN 37996 USA
关键词
Facebook; sports; predictive models; social media; sensitivity analysis; LOGISTIC-REGRESSION; NEURAL-NETWORKS; CLASSIFICATION; ENSEMBLE; CLASSIFIERS; MODELS; SYSTEM; FOREST; SPORT; TREES;
D O I
10.1287/deca.2017.0354
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
This study assesses the feasibility of identifying self-reported sports practitioners (soccer players) on Facebook. The main goal is to develop a system to support marketers with the decision as to which prospects to target for advertising purposes. To do so, we benchmark several algorithms (i.e., random forest, logistic regression, adaboost, rotation forest, neural networks, and kernel factory) using five times twofold cross-validation. To evaluate performance and variable importances, we build a fusion model, which combines the results of the other algorithms using the weighted average. This technique is also referred to as information-fusion sensitivity analysis. The results reveal that Facebook data provide a viable basis to come up with sports predictions as the predictive performance ranges from 72.01% to 80.43% for area under the receiver operating characteristic curve (AUC), from 81.96% to 83.95% for accuracy, and from 2.41 to 3.06 for top-decile lift. Our benchmark study indicates that stochastic adaboost, the fusion model, random forest, rotation forest, and regularized logistic regression are the best-performing algorithms. Furthermore, the results show that the most important variables are the average number of friends that play soccer, membership of a soccer group, and the number of favorite teams. We also assess the impact of our results on profitability by conducting a thorough sensitivity analysis. Our analysis reveals that our approach can be beneficial for a wide range of companies. The analysis and results in this study will assist sports brands with decisions regarding their implementation of targeted marketing approaches.
引用
收藏
页码:274 / 297
页数:24
相关论文
共 85 条
[1]  
[Anonymous], DEL FOOTB MON LEAG 2
[2]  
[Anonymous], 2011, WORKSHOP MUSIC RECOM
[3]   Serglycin-deficient cytotoxic T lymphocytes display defective secretory granule maturation and granzyme B storage [J].
Grujic, M ;
Braga, T ;
Lukinius, A ;
Eloranta, ML ;
Knight, SD ;
Pejler, G ;
Åbrink, M .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2005, 280 (39) :33411-33418
[4]  
Baatarjav EA, 2008, LECT NOTES COMPUT SC, V5333, P211
[5]   What you see may not be what you get: A brief, nontechnical introduction to overfitting in regression-type models [J].
Babyak, MA .
PSYCHOSOMATIC MEDICINE, 2004, 66 (03) :411-421
[6]   NEURAL NETWORKS AND PRINCIPAL COMPONENT ANALYSIS - LEARNING FROM EXAMPLES WITHOUT LOCAL MINIMA [J].
BALDI, P ;
HORNIK, K .
NEURAL NETWORKS, 1989, 2 (01) :53-58
[7]  
Ballings M., 2015, R PACKAGE ROTATIONFO
[8]   Evaluating multiple classifiers for stock price direction prediction [J].
Ballings, Michel ;
Van den Poel, Dirk ;
Hespeels, Nathalie ;
Gryp, Ruben .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (20) :7046-7056
[9]   CRM in social media: Predicting increases in Facebook usage frequency [J].
Ballings, Michel ;
Van den Poel, Dirk .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 244 (01) :248-260
[10]   Kernel Factory: An ensemble of kernel machines [J].
Ballings, Michel ;
Van den Poel, Dirk .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (08) :2904-2913