Personalized web-document filtering using reinforcement learning

被引:22
作者
Zhang, BT [1 ]
Seo, YW [1 ]
机构
[1] Seoul Natl Univ, Sch Comp Sci & Engn, Biointelligence Lab, Seoul 151742, South Korea
关键词
D O I
10.1080/088395101750363993
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document filtering is increasingly deployed in Web environments to reduce information overload of users. We formulate online information filtering as a reinforcement learning problem, i.e., TD(0). The goal is to learn user profiles that best represent information needs and thus maximize the expected value of user relevance feedback. A method is then presented that acquires reinforcement signals automatically by estimating user's implicit feedback from direct observations of browsing behaviors. This "learning by observation'' approach is contrasted with conventional relevance feedback methods which require explicit user feedbacks. Field tests have been performed that involved 10 users reading a total of 18,750 HTML documents during 45 days. Compared to the existing document filtering techniques, the proposed learning method showed superior performance in information quality and adaptation speed to user preferences in online filtering.
引用
收藏
页码:665 / 685
页数:21
相关论文
共 23 条
[1]  
[Anonymous], 1994, P 17 ANN INT ACM SIG
[2]   INFORMATION FILTERING AND INFORMATION-RETRIEVAL - 2 SIDES OF THE SAME COIN [J].
BELKIN, NJ ;
CROFT, WB .
COMMUNICATIONS OF THE ACM, 1992, 35 (12) :29-38
[3]  
BOYAN J, 1996, P AAAI WORKSH INT BA, P334
[4]  
Callan J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P224, DOI 10.1145/290941.290998
[5]  
Falk A., 1996, PAAM 96. Proceedings of the First International Conference on the Practical Application of Intelligent Agents and Multi-Agent Technology, P169
[6]  
FRAKES WB, 1992, INFORMATION RETRIEVA, P131
[7]  
HIRASHIMA T, 1998, P INT C INT US INT, P119
[8]  
JOACHIMS T, 1997, P 15 INT JOINT C ART, P770
[9]   ANATAGONOMY: A personalized newspaper on the World Wide Web [J].
Kamba, T ;
Sakagami, H ;
Koseki, Y .
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 1997, 46 (06) :789-803
[10]  
KINDO T, 1997, P 15 INT JOINT C ART, P716