Leveraging fine-grained mobile data for churn detection through Essence Random Forest

被引:5
作者
Colot, Christian [1 ]
Baecke, Philippe [2 ]
Linden, Isabelle [1 ]
机构
[1] Univ Namur, Dept Business Adm, Namur, Belgium
[2] Vlerick Business Sch, Ghent, Belgium
关键词
Telecom data; Random Forest; Customer churn; Customer analytics; Unstructured data; Probability models; HIGH-DIMENSIONAL DATA; MAXIMUM RELEVANCE; SOCIAL-INFLUENCE; PREDICTION; SELECTION; CLASSIFICATION; ALGORITHMS; NETWORKS; ENSEMBLE; OTT;
D O I
10.1186/s40537-021-00451-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The rise of unstructured data leads to unprecedented opportunities for marketing applications along with new methodological challenges to leverage such data. In particular, redundancy among the features extracted from this data deserves special attention as it might prevent current methods to benefit from it. In this study, we propose to investigate the value of multiple fine-grained data sources i.e. websurfing, use of applications and geospatial mobility for churn detection within telephone companies. This value is analysed both in substitution and in complement to the value of the well-known communication network. What is more, we also suggest an adaptation of the Random Forest algorithm called Essence Random Forest designed to better address redundancy among extracted features. Analysing fine-grained data of a telephone company, we first find that geo-spatial mobility data might be a good long term alternative to the classical communication network that might become obsolete due to the competition with digital communications. Then, we show that, on the short term, these alternative fine-grained data might complement the communication network for an improved churn detection. In addition, compared to Random Forest and Extremely Randomized Trees, Essence Random Forest better leverages the value of unstructured data by offering an enhanced churn detection regardless of the addressed perspective i.e. substitution or complement. Finally, Essence Random Forest converges faster to stable results which is a salient property in a resource constrained environment.
引用
收藏
页数:26
相关论文
共 59 条
  • [21] EcmPred: Prediction of extracellular matrix proteins based on random forest with maximum relevance minimum redundancy feature selection
    Kandaswamy, Krishna Kumar
    Pugalenthi, Ganesan
    Kalies, Kai-Uwe
    Hartmann, Enno
    Martinetz, Thomas
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2013, 317 : 377 - 383
  • [22] Heterogeneous oblique random forest
    Katuwal, Rakesh
    Suganthan, P. N.
    Zhang, Le
    [J]. PATTERN RECOGNITION, 2020, 99 (99)
  • [23] Kumar S, 2019, BIOFILMS IN HUMAN DISEASES: TREATMENT AND CONTROL, P1, DOI [10.1080/17597269.2019.1647375, 10.1007/978-3-030-30757-8_1]
  • [24] Kyrillidis Anastasios, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P4548, DOI 10.1109/ICASSP.2014.6854463
  • [25] Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research
    Lessmann, Stefan
    Baesens, Bart
    Seow, Hsin-Vonn
    Thomas, Lyn C.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 247 (01) : 124 - 136
  • [26] Prediction of Protein-Protein Interaction Sites by Random Forest Algorithm with mRMR and IFS
    Li, Bi-Qing
    Feng, Kai-Yan
    Chen, Lei
    Huang, Tao
    Cai, Yu-Dong
    [J]. PLOS ONE, 2012, 7 (08):
  • [27] Predicting interpurchase time in a retail environment using customer-product networks: An empirical study and evaluation
    Lismont, Jasmien
    Ram, Sudha
    Vanthienen, Jan
    Lemahieu, Wilfried
    Baesens, Bart
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 104 : 22 - 32
  • [28] Analysis and prediction of drug-drug interaction by minimum redundancy maximum relevance and incremental feature selection
    Liu, Lili
    Chen, Lei
    Zhang, Yu-Hang
    Wei, Lai
    Cheng, Shiwen
    Kong, Xiangyin
    Zheng, Mingyue
    Huang, Tao
    Cai, Yu-Dong
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2017, 35 (02) : 312 - 329
  • [29] Louppe G., 2014, Understanding random forests
  • [30] Latent Homophily or Social Influence? An Empirical Analysis of Purchase Within a Social Network
    Ma, Liye
    Krishnan, Ramayya
    Montgomery, Alan L.
    [J]. MANAGEMENT SCIENCE, 2015, 61 (02) : 454 - 473