Query expansion by mining user logs

被引:133
作者
Cui, H [1 ]
Wen, JR
Nie, JY
Ma, WY
机构
[1] Natl Univ Singapore, Dept Comp Sci, Sch Comp, Singapore 117543, Singapore
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
[3] Univ Montreal, Dept Informat & Rech Operat, CP 6128,Succursale Ctr Ville, Montreal, PQ H3C 3J7, Canada
关键词
query expansion; user log; probabilistic model; information retrieval; search engine;
D O I
10.1109/TKDE.2003.1209002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Queries to search engines on the Web are usually short. They do not provide sufficient information for an effective selection of relevant documents. Previous research has proposed the utilization of query expansion to deal with this problem. However, expansion terms are usually determined on term co-occurrences within documents. In this study, we propose a new method for query expansion based on user interactions recorded in user logs. The central idea is to extract correlations between query terms and document terms by analyzing user logs. These correlations are then used to select high-quality expansion terms for new queries. Compared to previous query expansion methods, ours takes advantage of the user judgments implied in user logs. The experimental results show that the log-based query expansion method can produce much better results than both the classical search method and the other query expansion methods.
引用
收藏
页码:829 / 839
页数:11
相关论文
共 41 条
  • [1] [Anonymous], P 16 ANN INT ACM SIG
  • [2] [Anonymous], INT J LEXICOGRAPHY
  • [3] [Anonymous], 1996, P 19 ANN INT ACM SIG, DOI DOI 10.1145/243199.243202
  • [4] BAEZAYATES RA, 1999, MODERN INFORMATION R
  • [5] BATES MJ, 1981, ANNU REV INFORM SCI, V16, P139
  • [6] Beeferman D., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P407, DOI 10.1145/347090.347176
  • [7] BRAJNIK G, 1996, P 19 ANN INT ACM SIG, P128
  • [8] Buckley C, 1992, P 1 TEXT RETR C TREC, P59
  • [9] BUCKLEY C, 1998, P 6 TEXT RETR C TREC, P107
  • [10] BUCKLEY GJ, 1994, TECHNOL DISABIL, V3, P69