Practice on Long Sequential User Behavior Modeling for Click-Through Rate Prediction

被引:137
作者
Pi, Qi [1 ]
Bian, Weijie [1 ]
Zhou, Guorui [1 ]
Zhu, Xiaoqiang [1 ]
Gai, Kun [1 ]
机构
[1] Alibaba Grp, Beijing, Peoples R China
来源
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2019年
关键词
Click-Through Rate Prediction; User Behavior Modeling;
D O I
10.1145/3292500.3330666
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Click-through rate (CTR) prediction is critical for industrial applications such as recommender system and online advertising. Practically, it plays an important role for CTR modeling in these applications by mining user interest from rich historical behavior data. Driven by the development of deep learning, deep CTR models with ingeniously designed architecture for user interest modeling have been proposed, bringing remarkable improvement of model performance over offline metric. However, great efforts are needed to deploy these complex models to online serving system for realtime inference, facing massive traffic request. Things turn to be more difficult when it comes to long sequential user behavior data, as the system latency and storage cost increase approximately linearly with the length of user behavior sequence. In this paper, we face directly the challenge of long sequential user behavior modeling and introduce our hands-on practice with the co-design of machine learning algorithm and online serving system for CTR prediction task. (i) From serving system view, we decouple the most resource-consuming part of user interest modeling from the entire model by designing a separate module named UIC (User Interest Center). UIC maintains the latest interest state for each user, whose update depends on realtime user behavior trigger event, rather than on traffic request. Hence UIC is latency free for realtime CTR prediction. (ii) From machine learning algorithm view, we propose a novel memory-based architecture named MIMN (Multi-channel user Interest Memory Network) to capture user interests from long sequential behavior data, achieving superior performance over state-of-the-art models. MIMN is implemented in an incremental manner with UIC module. Theoretically, the co-design solution of UIC and MIMN enables us to handle the user interest modeling with unlimited length of sequential behavior data. Comparison between model performance and system efficiency proves the effectiveness of proposed solution. To our knowledge, this is one of the first industrial solutions that are capable of handling long sequential user behavior data with length scaling up to thousands. It now has been deployed in the display advertising system in Alibaba.
引用
收藏
页码:2671 / 2679
页数:9
相关论文
共 30 条
  • [1] [Anonymous], LEARNING TREE BASED
  • [2] [Anonymous], 2016, ARXIV160506065
  • [3] [Anonymous], P 33 AAAI C ART INT
  • [4] [Anonymous], 2015, ARXIV151106939
  • [5] [Anonymous], DENSELY CONNECTED CO
  • [6] [Anonymous], 2015, ADV NEURAL INFORM PR
  • [7] [Anonymous], 2014, 3 INT C LEARN REPR
  • [8] [Anonymous], TAIR DISTRIBUTED KEY
  • [9] [Anonymous], 2016, P 1 WORKSH DEEP LEAR, DOI DOI 10.1145/2988450.2988454
  • [10] [Anonymous], 2018, ARXIV180410862