Feature-Interaction-Enhanced Sequential Transformer for Click-Through Rate Prediction

被引:0
作者
Yuan, Quan [1 ]
Zhu, Ming [2 ]
Li, Yushi [2 ]
Liu, Haozhe [2 ]
Guo, Siao [2 ]
机构
[1] China Univ Geosci, Sch Mech Engn & Elect Informat, Wuhan 430074, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Hubei Key Lab Smart Internet Technol, Wuhan 430074, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 07期
基金
中国国家自然科学基金;
关键词
click-through-rate prediction; feature interaction; sequential recommendation; sequence pooling; self-attention;
D O I
10.3390/app14072760
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Click-through rate (CTR) prediction plays a crucial role in online services and applications, such as online shopping and advertising. The performance of CTR prediction can have a direct impact on user experience and the revenue of the online platforms. For CTR prediction models, self-attention-based methods have been widely applied to this field. Recent works generally adopted the Transformer architecture, where the self-attention mechanism can capture the global dependencies of the user's historical interactions and predict the next item. Despite the effectiveness of self-attention methods in modeling sequential user behaviors, most sequential recommenders hardly exploit feature interaction techniques to extract high-order feature combinations. In this paper, we propose a Feature-Interaction-Enhanced Sequence Model (FESeq), which integrates feature interaction and the sequential recommendation model in a cascading structure. Specifically, the interacting layer in FESeq is an automatic feature engineering step for the Transformer model. Then, we add a linear time interval embedding layer and a positional embedding layer to the Transformer in the sequence-refiner layer to learn both the time intervals and the position information in the user's sequence behaviors. We also design an attention-based sequence pooling layer that can model the relevance of the user's historical behaviors and the target item representation through scaled bilinear attention. Our experiments show that the proposed method beats all the baselines on both public and industrial datasets.
引用
收藏
页数:24
相关论文
共 46 条
  • [41] A Simple Convolutional Generative Network for Next Item Recommendation
    Yuan, Fajie
    Karatzoglou, Alexandros
    Arapakis, Ioannis
    Jose, Joemon M.
    He, Xiangnan
    [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 582 - 590
  • [42] Zhou C, 2017, Arxiv, DOI arXiv:1711.06632
  • [43] Zhou GR, 2019, AAAI CONF ARTIF INTE, P5941
  • [44] Deep Interest Network for Click-Through Rate Prediction
    Zhou, Guorui
    Zhu, Xiaoqiang
    Song, Chengru
    Fan, Ying
    Zhu, Han
    Ma, Xiao
    Yan, Yanghui
    Jin, Junqi
    Li, Han
    Gai, Kun
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1059 - 1068
  • [45] FINAL: Factorized Interaction Layer for CTR Prediction
    Zhu, Jieming
    Jia, Qinglin
    Cai, Guohao
    Dai, Quanyu
    Li, Jingjie
    Dong, Zhenhua
    Tang, Ruiming
    Zhang, Rui
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2006 - 2010
  • [46] Open Benchmarking for Click-Through Rate Prediction
    Zhu, Jieming
    Liu, Jinyang
    Yang, Shuai
    Zhang, Qi
    He, Xiuqiang
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2759 - 2769