Incorporating Non-sequential Behavior into Click Models

被引:32
作者
Wang, Chao [1 ]
Liu, Yiqun [1 ]
Wang, Meng [2 ]
Zhou, Ke [3 ]
Nie, Jian-yun [4 ]
Ma, Shaoping [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol, Beijing, Peoples R China
[2] HeFei Univ Technol, Sch Comp & Informat, Hefei, Peoples R China
[3] Yahoo Labs, London, England
[4] Univ Montreal, Montreal, PQ H3C 3J7, Canada
来源
SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2015年
关键词
click model; non-sequential behavior; eye-tracking;
D O I
10.1145/2766462.2767712
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Click-through information is considered as a valuable source of users' implicit relevance feedback. As user behavior is usually influenced by a number of factors such as position, presentation style and site reputation, researchers have proposed a variety of assumptions (i.e. click models) to generate a reasonable estimation of result relevance. The construction of click models usually follow some hypotheses. For example, most existing click models follow the sequential examination hypothesis in which users examine results from top to bottom in a linear fashion. While these click models have been successful, many recent studies showed that there is a large proportion of non-sequential browsing (both examination and click) behaviors in Web search, which the previous models fail to cope with. In this paper, we investigate the problem of properly incorporating non-sequential behavior into click models. We firstly carry out a laboratory eye-tracking study to analyze user's non-sequential examination behavior and then propose a novel click model named Partially Sequential Click Model (PSCM) that captures the practical behavior of users. We compare PSCM with a number of existing click models using two real-world search engine logs. Experimental results show that PSCM outperforms other click models in terms of both predicting click behavior (perplexity) and estimating result relevance (NDCG and user preference test). We also publicize the implementations of PSCM and related datasets for possible future comparison studies.
引用
收藏
页码:283 / 292
页数:10
相关论文
共 32 条
[1]  
Agichtein E., 2006, Proceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P3, DOI 10.1145/1148170.1148175
[2]  
[Anonymous], 2011, THEORY USE EM ALGORI
[3]  
[Anonymous], 2008, P 2008 INT C WEB SEA, DOI [10.1145/1341531, DOI 10.1145/1341531.1341545]
[4]  
Broder A., 2002, SIGIR Forum, V36, P3, DOI 10.1145/792550.792552
[5]  
Buscher Georg., 2012, WSDM, P373, DOI [10.1145/2124295.2124341, DOI 10.1145/2124295.2124341]
[6]  
Chapelle Olivier, 2009, ACM WWW 09
[7]  
Chen Danqi, 2012, P 5 ACM INT C WEB SE, P463, DOI DOI 10.1145/2124295.2124351
[8]   Revisiting the evaluation of diversified search evaluation metrics with user preferences [J].
Chen, Fei ;
Liu, Yiqun ;
Dou, Zhicheng ;
Xu, Keyang ;
Cao, Yujie ;
Zhang, Min ;
Ma, Shaoping .
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8870 :48-59
[9]  
Chuklin A, 2013, SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, P493
[10]  
Cutrell E, 2007, CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1 AND 2, P407