FedCTR: Federated Native Ad CTR Prediction with Cross-platform User Behavior Data

被引:18
作者
Wu, Chuhan [1 ]
Wu, Fangzhao [2 ]
Lyu, Lingjuan [3 ]
Huang, Yongfeng [1 ]
Xie, Xing [2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
[3] Sony AI, Minato Ku, 1-7-1 Konan, Tokyo 1080075, Japan
关键词
Native Ad; CTR Prediction; Federated Learning; Privacy-preserving;
D O I
10.1145/3506715
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Native ad is a popular type of online advertisement that has similar forms with the native content displayed on websites. Native ad click-through rate (CTR) prediction is useful for improving user experience and platform revenue. However, it is challenging due to the lack of explicit user intent, and user behaviors on the platform with native ads may be insufficient to infer users' interest in ads. Fortunately, user behaviors exist on many online platforms that can provide complementary information for user-interest mining. Thus, leveraging multi-platform user behaviors is useful for native ad CTR prediction. However, user behaviors are highly privacy-sensitive, and the behavior data on different platforms cannot be directly aggregated due to user privacy concerns and data protection regulations. Existing CTR prediction methods usually require centralized storage of user behavior data for user modeling, which cannot be directly applied to the CTR prediction task with multi-platform user behaviors. In this article, we propose a federated native ad CTR prediction method named FedCTR, which can learn user-interest representations from cross-platform user behaviors in a privacy-preserving way. On each platform a local user model learns user embeddings from the local user behaviors on that platform. The local user embeddings from different platforms are uploaded to a server for aggregation, and the aggregated ones are sent to the ad platform for CTR prediction. Besides, we apply local differential privacy and differential privacy to the local and aggregated user embeddings, respectively, for better privacy protection. Moreover, we propose a federated framework for collaborative model training with distributed models and user behaviors. Extensive experiments on real-world dataset show that FedCTR can effectively leverage multi-platform user behaviors for native ad CTR prediction in a privacy-preserving manner.
引用
收藏
页数:19
相关论文
共 45 条
  • [1] Neural CTR Prediction for Native Ad
    An, Mingxiao
    Wu, Fangzhao
    Wang, Heyuan
    Di, Tao
    Huang, Jianqiang
    Xie, Xing
    [J]. CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 600 - 612
  • [2] [Anonymous], 2008, WWW
  • [3] Large-Scale Machine Learning with Stochastic Gradient Descent
    Bottou, Leon
    [J]. COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 177 - 186
  • [4] Chen J., 2016, P 24 ACM INT C MULT, P811, DOI [DOI 10.1145/2964284.2964325, https://doi.org/10.1145/2964284.2964325]
  • [5] Cheng H.-. T., 2016, P 1 WORKSH DEEP LEAR, P7, DOI DOI 10.1145/2988450.2988454
  • [6] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [7] Dwork C, 2006, LECT NOTES COMPUT SC, V4052, P1
  • [8] Feng SW, 2020, Arxiv, DOI arXiv:2001.11154
  • [9] Feng YF, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2301
  • [10] Guo HF, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1725