Applying reinforcement learning for web pages ranking algorithms

被引:29
作者
Derhami, Vali [1 ]
Khodadadian, Elahe [1 ]
Ghasemzadeh, Mohammad [1 ]
Bidoki, Ali Mohammad Zareh [1 ]
机构
[1] Yazd Univ, Elect & Comp Engn Dept, Yazd, Iran
基金
美国国家科学基金会;
关键词
Ranking; Search engine; Reinforcement Learning; Artificial intelligence; Value function; Agent;
D O I
10.1016/j.asoc.2012.12.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ranking web pages for presenting the most relevant web pages to user's queries is one of the main issues in any search engine. In this paper, two new ranking algorithms are offered, using Reinforcement Learning (RL) concepts. RL is a powerful technique of modern artificial intelligence that tunes agent's parameters, interactively. In the first step, with formulation of ranking as an RL problem, a new connectivity-based ranking algorithm, called RL Rank, is proposed. In RL Rank, agent is considered as a surfer who travels between web pages by clicking randomly on a link in the current page. Each web page is considered as a state and value function of state is used to determine the score of that state (page). Reward is corresponded to number of out links from the current page. Rank scores in RL Rank are computed in a recursive way. Convergence of these scores is proved. In the next step, we introduce a new hybrid approach using combination of BM25 as a content-based algorithm and RL Rank. Both proposed algorithms are evaluated by well known benchmark datasets and analyzed according to concerning criteria. Experimental results show using RL concepts leads significant improvements in raking algorithms. (C) 2013 Elsevier B. V. All rights reserved.
引用
收藏
页码:1686 / 1692
页数:7
相关论文
共 28 条
[1]  
[Anonymous], 1971, The SMART Retrieval System-Experiments in Automatic Document Processing
[2]  
[Anonymous], 2004, COLING 2004 P 20 INT
[3]  
[Anonymous], 2005, P 14 INT C WORLD WID, DOI 10.1145/1060745.1060827
[4]  
[Anonymous], 1998, P 7 INT WORLD WID WE
[5]  
[Anonymous], P SIGIR 2007 WORKSH
[6]  
[Anonymous], 1999, TECHNICAL REPORT
[7]  
Berger Henengouwen S., 1998, ENG NUMERICAL ANAL
[8]   DistanceRank: An intelligent ranking algorithm for web pages [J].
Bidoki, Ali Mohammad Zareh ;
Yazdani, Nasser .
INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (02) :877-892
[9]   A3CRank: An adaptive ranking method based on connectivity, content and click-through data [J].
Bidoki, Ali Mohammad Zareh ;
Ghodsnia, Pedram ;
Yazdani, Nasser ;
Oroumchian, Farhad .
INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (02) :159-169
[10]  
Borda J.C.de., 1781, HIST ACAD ROYAL DES