RLNF: Reinforcement Learning based Noise Filtering for Click-Through Rate Prediction

被引：8

作者：

Zhao, Pu ^{[1
]}

Luo, Chuan ^{[1
]}

Zhou, Cheng ^{[1
]}

Qiao, Bo ^{[1
]}

He, Jiale ^{[1
]}

Zhang, Liangjie ^{[1
]}

Lin, Qingwei ^{[1
]}

机构：

[1] Microsoft, Beijing, Peoples R China

来源：

SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2021年

关键词：

CTR Prediction; Reinforcement Learning; Noise Filtering;

D O I：

10.1145/3404835.3463012

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Click-through rate (CTR) prediction aims to recall the advertisements that users are interested in and to lead users to click, which is of critical importance for a variety of online advertising systems. In practice, CTR prediction is generally formulated as a conventional binary classification problem, where the clicked advertisements are positive samples and the others are negative samples. However, directly treating unclicked advertisements as negative samples would suffer from the severe label noise issue, since there exist many reasons why users are interested in a few advertisements but do not click. To address such serious issue, we propose a reinforcement learning based noise filtering approach, dubbed RLNF, which employs a noise filter to select effective negative samples. In RLNF, such selected, effective negative samples can be used to enhance the CTR prediction model, and meanwhile the effectiveness of the noise filter can be enhanced through reinforcement learning using the performance of CTR prediction model as reward. Actually, by alternating the enhancements of the noise filter and the CTR prediction model, the performance of both the noise filter and the CTR prediction model is improved. In our experiments, we equip 7 state-of-the-art CTR prediction models with RLNF. Extensive experiments on a public dataset and an industrial dataset present that RLNF significantly improves the performance of all these 7 CTR prediction models, which indicates both the effectiveness and the generality of RLNF.

引用

页码：2268 / 2272

页数：5

共 41 条

[1] CTRec: A Long-Short Demands Evolution Model for Continuous-Time Recommendation [J].

Bai, Ting ;

Zou, Lixin ;

Zhao, Wayne Xin ;

Du, Pan ;

Liu, Weidong ;

Nie, Jian-Yun ;

Wen, Ji-Rong .

PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, :675-684

[2] Click Through Rate Prediction for Local Search Results [J].

Cacheda, Fidel ;

Barbieri, Nicola ;

Blanco, Roi .

WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, :171-180

[3]

Chan PPK, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2007

[4]

Cheng H., 2012, P INT WORKSHOP DATA, P9

[5]

Cheng Heng-Tze, 2016, P 1 WORKSHOP DEEP LE, P7

[6] Deep Neural Networks for YouTube Recommendations [J].

Covington, Paul ;

Adams, Jay ;

Sargin, Emre .

PROCEEDINGS OF THE 10TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'16), 2016, :191-198

[7]

Deng Wei, P WSDM 2021, P922

[8]

Deng Y, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1589

[9] Deep Character-Level Click-Through Rate Prediction for Sponsored Search [J].

Edizel, Bora ;

Mantrach, Amin ;

Bai, Xiao .

SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, :305-314

[10]

Feng YF, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2301

← 1 2 3 4 5 →