Topological data analysis in digital marketing

被引:2
作者
Lakshminarayan, Choudur [1 ,2 ]
Yin, Mingzhang [1 ]
机构
[1] Univ Texas Austin, Dept Stat & Data Sci, Austin, TX USA
[2] Teradata Labs, Austin, TX 78759 USA
关键词
clickstreams; computational topology; landscapes; Markov chains; persistence diagrams; silhouettes;
D O I
10.1002/asmb.2563
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The ubiquitous internet is a multipurpose platform for finding information, an avenue for social interaction, and a primary customer touch-point as a marketplace to conduct e-commerce. The digital footprints of browsers are a rich source of data to drive sales. We use clickstreams (clicks) to track the evolution of session-level customer browsing for modeling. We apply Markov chains (MC) to calculate probabilities of page-level transitions from which relevant topological features (persistence diagrams) are extracted to determine optimal points (URL pages) for marketing intervention. We use topological summaries (silhouettes, landscapes) to distinguish thebuyersandnonbuyersto determine the likelihood of conversion of active user sessions. Separately, we model browsing patterns via Markov chain theory to predict users' propensity to buy within a session. Extensive analysis of data applied to a large commercial website demonstrates that the proposed approaches are useful predictors of user behavior and intent. Utilizing computational topology in digital marketing holds tremendous promise. We demonstrate the utility of topological data analysis combined with MC and present its merits and disadvantages.
引用
收藏
页码:1014 / 1028
页数:15
相关论文
共 16 条
[1]  
[Anonymous], 2004, ACM Transactions on Internet Technology, DOI DOI 10.1145/990301.990304
[2]   Evaluating variable-length Markov chain models for analysis of user Web navigation sessions [J].
Borges, Jose ;
Levene, Mark .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (04) :441-452
[3]  
Brusilovsky P., 2007, LECT NOTES COMPUTER
[4]  
Bubenik P., 2015, ARXIV12076437
[5]  
Chazal F., 2014, SOCG 14, P474
[6]  
Edelsbrunner H, 2008, CONTEMP MATH, V453, P257
[7]  
Efron B., 1993, MONOGR STAT APPL PRO, DOI [10.1111/1467-9639.00050, 10.1007/978-1-4899-4541-9]
[8]  
Eirinaki M, 2005, P 7 ACM INT WORKSH W, P2
[9]  
Fasy B. T., 2015, ARXIV14111830
[10]   Using persistent homology and dynamical distances to analyze protein binding [J].
Kovacev-Nikolic, Violeta ;
Bubenik, Peter ;
Nikolic, Dragan ;
Heo, Giseon .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2016, 15 (01) :19-38