Combination of document priors in Web Information Retrieval

被引:0
|
作者
Peng, Jie [1 ]
Ounis, Iadh [1 ]
机构
[1] Univ Glasgow, Dept Comp Sci, Glasgow G12 8QQ, Lanark, Scotland
来源
ADVANCES IN INFORMATION RETRIEVAL | 2007年 / 4425卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Query independent features (also called document priors), such as the number of incoming links to a document, its PageRank, or the length of its associated URL, have been explored to boost the retrieval effectiveness of Web Information Retrieval (IR) systems. The combination of such query independent features could further enhance the retrieval performance. However, most current combination approaches are based on heuristics, which ignore the possible dependence between the document priors. In this paper, we present a novel and robust method for combining document priors in a principled way. We use a conditional probability rule, which is derived from Kolmogorov's axioms. In particular, we investigate the retrieval performance attainable by our combination of priors method, in comparison to the use of single priors and a heuristic prior combination method. Furthermore, we examine when and how document priors should be combined.
引用
收藏
页码:732 / +
页数:2
相关论文
共 50 条
  • [31] Combination methods for crosslingual Web retrieval
    Kamps, Jaap
    de Rijke, Maarten
    Sigurbjornsson, Borkur
    ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 856 - 864
  • [32] BEYOND DOCUMENT-RETRIEVAL TOWARD INFORMATION-RETRIEVAL
    ALLEN, LE
    MINNESOTA LAW REVIEW, 1963, 47 (05) : 713 - 767
  • [33] Manual and automated document retrieval at the NSF Web site
    Wink, DJ
    JOURNAL OF CHEMICAL EDUCATION, 1998, 75 (05) : 535 - 535
  • [34] Document Creation with Information Retrieval System Support
    Betina, Timotej
    Polasek, Ivan
    14TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2013, : 147 - 151
  • [35] Extractive spoken document summarization for information retrieval
    Chen, Berlin
    Chen, Yi-Ting
    PATTERN RECOGNITION LETTERS, 2008, 29 (04) : 426 - 437
  • [36] Rank by Readability: Document Weighting for Information Retrieval
    Newbold, Neil
    McLaughlin, Harry
    Gillam, Lee
    ADVANCES IN MULTIDISCIPLINARY RETRIEVAL, 2010, 6107 : 20 - 30
  • [37] Using document dimensions for enhanced information retrieval
    Jayasooriya, T
    Manandhar, S
    APPLIED COMPUTING, PROCEEDINGS, 2004, 3285 : 145 - 152
  • [38] Job information retrieval based on document similarity
    Wang, Jingfan
    Xia, Yunqing
    Zheng, Thomas Fang
    Wu, Xiaojun
    INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 165 - +
  • [39] A "stereo" document representation for textual information retrieval
    Chen, L
    Zeng, J
    Tokuda, N
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (06): : 768 - 774
  • [40] Modeling Document Scores for Distributed Information Retrieval
    Markov, Ilya
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1321 - 1321