A personalized ranking method based on inverse reinforcement learning in search engines

被引:1
作者
Karamiyan, Fatemeh [1 ]
Mahootchi, Masoud [1 ]
Mohebi, Azadeh [2 ]
机构
[1] Amirkabir Univ Technol, Dept Ind Engn & Management Syst, Tehran, Iran
[2] Iranian Res Inst Informat Sci & Technol IranDoc, Tehran, Iran
关键词
Inverse reinforcement learning; Search engine; Ranking algorithm; Reward function; INFORMATION;
D O I
10.1016/j.engappai.2024.108915
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a new, novel ranking method called Inverse-Reinforcement Learning Ranking. The main goal is to find a reward function representing the user's perceived utility after clicking on each result. It is necessary to utilize log information of all users' queries in the search engine dataset to reach this goal while assuming that the decisions (clicks) of the users are the best (optimal policy). The respective reward function is constructed using features extracted through a feature selection, and their corresponding weights are obtained by two optimization models, which are applicable for ranking results represented to the new users. In addition, new performance criteria were developed to illustrate the performance of the presented ranking method. To evaluate and test the proposed ranking algorithm, a real medium-sized dataset from a search engine is preprocessed and used in this research. Findings show promising results and decisive superiority over the default ranking method. It is illustrated that clicks on the top five results, top ten, and even the first results are remarkably improved by about 13-19% in all experiments, and the perplexity remarkably decreases by almost 23% after applying the ranking method.
引用
收藏
页数:19
相关论文
共 51 条
  • [1] Agichtein Eugene, 2018, ACM SIGIR Forum, V52, P11, DOI 10.1145/3308774.3308778
  • [2] [Anonymous], 2012, P 5 ACM INT C WEB SE, DOI [10.1145/2124295, DOI 10.1145/2124295.2124336]
  • [3] A3CRank: An adaptive ranking method based on connectivity, content and click-through data
    Bidoki, Ali Mohammad Zareh
    Ghodsnia, Pedram
    Yazdani, Nasser
    Oroumchian, Farhad
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (02) : 159 - 169
  • [4] A Click Sequence Model for Web Search
    Borisov, Alexey
    Wardenaar, Martijn
    Markov, Ilya
    de Rijke, Maarten
    [J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 45 - 54
  • [5] Chirita PA, 2004, LECT NOTES COMPUT SC, V3137, P34
  • [6] Chuklin A., 2015, Synthesis Lectures on Information Concepts, Retrieval, and Services
  • [7] Derhami V., 2019, Journal of AI and Data Mining, V7, P421, DOI [10.22044/jadm.2019.3547.1814, DOI 10.22044/JADM.2019.3547.1814]
  • [8] Applying reinforcement learning for web pages ranking algorithms
    Derhami, Vali
    Khodadadian, Elahe
    Ghasemzadeh, Mohammad
    Bidoki, Ali Mohammad Zareh
    [J]. APPLIED SOFT COMPUTING, 2013, 13 (04) : 1686 - 1692
  • [9] Dupret G., 2010, P 3 ACM INT C WEB SE, P181
  • [10] Dupret Georges E., 2008, P 31 ANN INT ACM SIG, P331, DOI DOI 10.1145/1390334.1390392