FiBiNET: Combining Feature Importance and Bilinear Feature Interaction for Click-Through Rate Prediction

被引:157
作者
Huang, Tongwen [1 ]
Zhang, Zhiqi [1 ]
Zhang, Junlin [1 ]
机构
[1] Sina Weibo Inc, Beijing, Peoples R China
来源
RECSYS 2019: 13TH ACM CONFERENCE ON RECOMMENDER SYSTEMS | 2019年
关键词
Display Advertising; CTR Prediction; Factorization Machines; Squeeze Excitation network; Neural Network; Bilinear Function;
D O I
10.1145/3298689.3347043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advertising and feed ranking are essential to many Internet companies such as Facebook and Sina Weibo. Among many real-world advertising and feed ranking systems, click through rate (CTR) prediction plays a central role. There are many proposed models in this field such as logistic regression, tree based models, factorization machine based models and deep learning based CTR models. However, many current works calculate the feature interactions in a simple way such as Hadamard product and inner product and they care less about the importance of features. In this paper, a new model named FiBiNET as an abbreviation for Feature Importance and Bilinear feature Interaction NETwork is proposed to dynamically learn the feature importance and fine-grained feature interactions. On the one hand, the FiBiNET can dynamically learn the importance of features via the Squeeze-Excitation network (SENET) mechanism; on the other hand, it is able to effectively learn the feature interactions via bilinear function. We conduct extensive experiments on two real world datasets and show that our shallow model outperforms other shallow models such as factorization machine(FM) and field-aware factorization machine(FFM). In order to improve performance further, we combine a classical deep neural network(DNN) component with the shallow model to be a deep model. The deep FiBiNET consistently outperforms the other state-of-the-art deep models such as DeepFM and extreme deep factorization machine(XdeepFM).
引用
收藏
页码:169 / 177
页数:9
相关论文
共 26 条
  • [1] [Anonymous], 2017, ARXIV170304247
  • [2] [Anonymous], 2017, ARXIV170804617
  • [3] [Anonymous], 2010, WEB SCALE BAYESIAN C
  • [4] [Anonymous], 2016, P 1 WORKSH DEEP LEAR, DOI DOI 10.1145/2988450.2988454
  • [5] Cho Kyunghyun, 2014, C EMPIRICAL METHODS, P1724
  • [6] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [7] Impact of Highway Investment on the Economy and Employment Across US Industrial Sectors Simultaneous Equations Analysis at the Metropolitan Level
    He, Xiang
    Kastrouni, Eirini
    Zhang, Lei
    [J]. TRANSPORTATION RESEARCH RECORD, 2014, (2452) : 1 - 10
  • [8] Neural Factorization Machines for Sparse Predictive Analytics
    He, Xiangnan
    Chua, Tat-Seng
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 355 - 364
  • [9] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
  • [10] Field-aware Factorization Machines for CTR Prediction
    Juan, Yuchin
    Zhuang, Yong
    Chin, Wei-Sheng
    Lin, Chih-Jen
    [J]. PROCEEDINGS OF THE 10TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'16), 2016, : 43 - 50