Robust Factorization Machines for User Response Prediction

被引:14
作者
Punjabi, Surabhi [1 ]
Bhatt, Priyanka [1 ]
机构
[1] WalmartLabs, Bangalore, Karnataka, India
来源
WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018) | 2018年
关键词
Factorization Machines; Field-aware Factorization Machines; Robust Optimization; Computational Advertising; Response Prediction; Interval Uncertainty;
D O I
10.1145/3178876.3186148
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Factorization machines (FMs) are a state-of-the-art model class for user response prediction in the computational advertising domain. Rapid growth of internet and mobile device usage has given rise to multiple customer touchpoints. This coupled with factors like high cookie churn rate results in a fragmented view of user activity at the advertiser's end. Current literature assumes procured user signals as the absolute truth, which is contested by the absence of deterministic identity linkage across a user's multiple avatars. In this work, we characterize the data uncertainty using Robust Optimization (RO) paradigm to design approaches that are immune against perturbations. We propose two novel algorithms: robust factorization machine (RFM) and its field-aware variant (RFFM), under interval uncertainty. These formulations are generic and can find applicability in any classification setting under noise. We provide a distributed and scalable Spark implementation using parallel stochastic gradient descent. In the experiments conducted on three real-world datasets, the robust counterparts outperform the baselines significantly under perturbed settings. Our experimental findings reveal interesting connections between choice of uncertainty set and the noise-pro ofness of resulting models.
引用
收藏
页码:669 / 678
页数:10
相关论文
共 28 条
  • [21] Pan Z, 2016, IEEE DATA MINING, P400, DOI [10.1109/ICDM.2016.0051, 10.1109/ICDM.2016.53]
  • [22] Qu YR, 2016, IEEE DATA MINING, P1149, DOI [10.1109/ICDM.2016.0151, 10.1109/ICDM.2016.57]
  • [23] Rendle Steffen, 2010, Proceedings 2010 10th IEEE International Conference on Data Mining (ICDM 2010), P995, DOI 10.1109/ICDM.2010.127
  • [24] Linking Users Across Domains with Location Data: Theory and Validation
    Riederer, Chris
    Kim, Yunsung
    Chaintreau, Augustin
    Korula, Nitish
    Lattanzi, Silvio
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 707 - 719
  • [25] Tuzlukov V, 2002, ELECTR ENG APPL SIGN
  • [26] Wang Jun, 2016, ABS161003013 CORR
  • [27] Apache Spark: A Unified Engine for Big Data Processing
    Zaharia, Matei
    Xin, Reynold S.
    Wendell, Patrick
    Das, Tathagata
    Armbrust, Michael
    Dave, Ankur
    Meng, Xiangrui
    Rosen, Josh
    Venkataraman, Shivaram
    Franklin, Michael J.
    Ghodsi, Ali
    Gonzalez, Joseph
    Shenker, Scott
    Stoica, Ion
    [J]. COMMUNICATIONS OF THE ACM, 2016, 59 (11) : 56 - 65
  • [28] GB-CENT: Gradient Boosted Categorical Embedding and Numerical Trees
    Zhao, Qian
    Shi, Yue
    Hong, Liangjie
    [J]. PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 1311 - 1319