A personalized ranking method based on inverse reinforcement learning in search engines

被引：1

作者：

Karamiyan, Fatemeh ^{[1
]}

Mahootchi, Masoud ^{[1
]}

Mohebi, Azadeh ^{[2
]}

机构：

[1] Amirkabir Univ Technol, Dept Ind Engn & Management Syst, Tehran, Iran

[2] Iranian Res Inst Informat Sci & Technol IranDoc, Tehran, Iran

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 136卷

关键词：

Inverse reinforcement learning; Search engine; Ranking algorithm; Reward function; INFORMATION;

D O I：

10.1016/j.engappai.2024.108915

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes a new, novel ranking method called Inverse-Reinforcement Learning Ranking. The main goal is to find a reward function representing the user's perceived utility after clicking on each result. It is necessary to utilize log information of all users' queries in the search engine dataset to reach this goal while assuming that the decisions (clicks) of the users are the best (optimal policy). The respective reward function is constructed using features extracted through a feature selection, and their corresponding weights are obtained by two optimization models, which are applicable for ranking results represented to the new users. In addition, new performance criteria were developed to illustrate the performance of the presented ranking method. To evaluate and test the proposed ranking algorithm, a real medium-sized dataset from a search engine is preprocessed and used in this research. Findings show promising results and decisive superiority over the default ranking method. It is illustrated that clicks on the top five results, top ten, and even the first results are remarkably improved by about 13-19% in all experiments, and the perplexity remarkably decreases by almost 23% after applying the ranking method.

引用

页数：19

共 51 条

[1] Agichtein Eugene, 2018, ACM SIGIR Forum, V52, P11, DOI 10.1145/3308774.3308778
[2] [Anonymous], 2012, P 5 ACM INT C WEB SE, DOI [10.1145/2124295, DOI 10.1145/2124295.2124336]
[3] A3CRank: An adaptive ranking method based on connectivity, content and click-through data
Bidoki, Ali Mohammad Zareh
Ghodsnia, Pedram
Yazdani, Nasser
Oroumchian, Farhad
[J]. INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (02) : 159 - 169
[4] A Click Sequence Model for Web Search
Borisov, Alexey
Wardenaar, Martijn
Markov, Ilya
de Rijke, Maarten
[J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 45 - 54
[5] Chirita PA, 2004, LECT NOTES COMPUT SC, V3137, P34
[6] Chuklin A., 2015, Synthesis Lectures on Information Concepts, Retrieval, and Services
[7] Derhami V., 2019, Journal of AI and Data Mining, V7, P421, DOI [10.22044/jadm.2019.3547.1814, DOI 10.22044/JADM.2019.3547.1814]
[8] Applying reinforcement learning for web pages ranking algorithms
Derhami, Vali
Khodadadian, Elahe
Ghasemzadeh, Mohammad
Bidoki, Ali Mohammad Zareh
[J]. APPLIED SOFT COMPUTING, 2013, 13 (04) : 1686 - 1692
[9] Dupret G., 2010, P 3 ACM INT C WEB SE, P181
[10] Dupret Georges E., 2008, P 31 ANN INT ACM SIG, P331, DOI DOI 10.1145/1390334.1390392

← 1 2 3 4 5 6 →