A new interest extraction method based on multi-head attention mechanism for CTR prediction

被引:4
|
作者
Yang, Haifeng [1 ]
Yao, Linjing [1 ]
Cai, Jianghui [1 ,2 ]
Wang, Yupeng [1 ]
Zhao, Xujun [1 ]
机构
[1] Taiyuan Univ Sci & Technol, Sch Comp Sci & Technol, Waliu Rd, Taiyuan 030024, Peoples R China
[2] North Univ China, Sch Comp Sci & Technol, Xueyuan Rd, Taiyuan 030051, Peoples R China
基金
中国国家自然科学基金;
关键词
Recommendation system; Multi-head attention; Feature interaction; Click-through rate prediction;
D O I
10.1007/s10115-023-01867-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Click-through rate (CTR) prediction plays a vital role in recommendation systems. Most models pay little attention to the relationship between target items in the user behavior sequence. The attention units used in these models cannot fully capture the context information, which can be used to reflect the variations of user interests. To address these problems, we propose a new model named interest extraction method based on multi-head attention mechanism (IEN) for CTR prediction. Specifically, we design an interest extraction module, which consists of two sub-modules: the item representation module (IRM) and the context-item interaction module (CIM). In IRM, we learn the relationship between target items in the user behavior sequence by a multi-head attention mechanism. Then, the user representation is gained by integrating the refined item representation and position information. At last, the correlation between the user and the target item is used to reflect user interests. In CIM, the context information has valuable temporal features which can reflect the variations of user interests. Therefore, user interests can be further acquired through the feature interaction between the context and the target item. After that, the learned relevance and the feature interaction are fed to the multi-layer perceptron (MLP) for prediction. Besides, experiments on four Amazon datasets were conducted to evaluate the effectiveness of our method in capturing user interests. The experimental results show that our proposed method outperforms state-of-the-art methods in terms of AUC and RI in the CTR prediction task.
引用
收藏
页码:3337 / 3352
页数:16
相关论文
共 50 条
  • [1] A new interest extraction method based on multi-head attention mechanism for CTR prediction
    Haifeng Yang
    Linjing Yao
    Jianghui Cai
    Yupeng Wang
    Xujun Zhao
    Knowledge and Information Systems, 2023, 65 : 3337 - 3352
  • [2] Network Configuration Entity Extraction Method Based on Transformer with Multi-Head Attention Mechanism
    Yang, Yang
    Qu, Zhenying
    Yan, Zefan
    Gao, Zhipeng
    Wang, Ti
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (01): : 735 - 757
  • [3] A Novel Trajectory Prediction Method Based on CNN, BiLSTM, and Multi-Head Attention Mechanism
    Xu, Yue
    Pan, Quan
    Wang, Zengfu
    Hu, Baoquan
    AEROSPACE, 2024, 11 (10)
  • [4] A cellular traffic prediction method based on diffusion convolutional GRU and multi-head attention mechanism
    Xiao, Junbi
    Cong, Yunhuan
    Zhang, Wenjing
    Weng, Wenchao
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (02):
  • [5] Ship pitch prediction method based on LSTMC and multi-head attention
    Wang, Yuchao
    Dou, Yanbing
    Yang, Zhouqi
    Fu, Huixuan
    OCEAN ENGINEERING, 2024, 309
  • [6] TMH: Two-Tower Multi-Head Attention neural network for CTR prediction
    An, Zijian
    Joe, Inwhee
    PLOS ONE, 2024, 19 (03):
  • [7] Trajectory Prediction in Complex Scenes Based on Multi-Head Attention Adversarial Mechanism
    Yu L.
    Li H.-Y.
    Jiao C.-L.
    Leng Y.-F.
    Xu G.-Y.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (06): : 1133 - 1146
  • [8] Remaining Useful Life Prediction of Aeroengines Based on Multi-Head Attention Mechanism
    Nie, Lei
    Xu, Shiyi
    Zhang, Lvfan
    Yin, Yehan
    Dong, Zhengqiong
    Zhou, Xiangdong
    MACHINES, 2022, 10 (07)
  • [9] A structured multi-head attention prediction method based on heterogeneous financial data
    Zhao, Cheng
    Li, Fangyong
    Peng, Zhe
    Zhou, Xiao
    Yan, Zhuge
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [10] Click-through rate prediction model integrating user interest and multi-head attention mechanism
    Zhang, Wei
    Han, Yahui
    Yi, Baolin
    Zhang, Zhaoli
    JOURNAL OF BIG DATA, 2023, 10 (01)