Prediction of the high-cost normalised discounted cumulative gain (nDCG) measure in information retrieval evaluation

被引:1
作者
Muwanei, Sinyinda [2 ]
Ravana, Sri Devi [1 ]
Hoo, Wai Lam [3 ]
Kunda, Douglas
机构
[1] Univ Malaya, Dept Informat Syst, Kuala Lumpur, Malaysia
[2] Univ Malaya, Kuala Lumpur, Malaysia
[3] Univ Malaya, Dept Informat Syst, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
关键词
JUDGMENTS;
D O I
10.47989/irpaper928
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Introduction. Information retrieval systems are vital to meeting daily information needs of users. The effectiveness of these systems has often been evaluated using the test collections approach, despite the high evaluation costs of this approach. Recent methods have been proposed that reduce evaluation costs through the prediction of information retrieval performance measures at the higher cut-off depths using other measures computed at the lower cut-off depths. The purpose of this paper is to propose two methods that addresses the challenge of accurately predicting the normalised discounted cumulative gain (nDCG) measure. Method. Data from selected test collections of the Text REtrieval Conference was used. The proposed methods employ the gradient boosting and linear regression models trained with topic scores of measures partitioned by TREC Tracks. Analysis. To evaluate the proposed methods, the coefficient of determination, Kendall's tau and Spearman correlations were used. Results. The proposed methods provide better predictions of the nDCG measure at the higher cut-off depths while using other measures computed at the lower cut-off depths. Conclusions. These proposed methods have shown improvement in the predictions of the nDCG measure while reducing the evaluation costs.
引用
收藏
页数:22
相关论文
共 38 条
[1]  
[Anonymous], 2004, P 13 TEXT RETRIEVAL
[2]  
[Anonymous], 2008, Introduction to information retrieval
[3]  
Aslam J. A., 2005, SIGIR 2005. Proceedings of the Twenty-Eighth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P573, DOI 10.1145/1076034.1076134
[4]  
Aslam Javed A., 2007, P 16 ACM C CONFERENC, P633, DOI DOI 10.1145/1321440.1321529
[5]  
Berto A., 2013, ICTIR 13 P 2013 C TH, P30, DOI [10.1145/2499178.2499184, DOI 10.1145/2499178.2499184]
[6]  
Buckley C., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P25, DOI 10.1145/1008992.1009000
[7]  
Buttcher Stefan, 2007, 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P63, DOI 10.1145/1277741.1277755
[8]  
Carterette B., 2006, Proceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P268, DOI 10.1145/1148170.1148219
[9]  
Carterette B., 2008, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '08, P651
[10]  
Chapelle Olivier, 2009, P 18 ACM C INFORM KN, P621, DOI DOI 10.1145/1645953.1646033