Combination of document priors in Web Information Retrieval

被引:0
|
作者
Peng, Jie [1 ]
Ounis, Iadh [1 ]
机构
[1] Univ Glasgow, Dept Comp Sci, Glasgow G12 8QQ, Lanark, Scotland
来源
ADVANCES IN INFORMATION RETRIEVAL | 2007年 / 4425卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Query independent features (also called document priors), such as the number of incoming links to a document, its PageRank, or the length of its associated URL, have been explored to boost the retrieval effectiveness of Web Information Retrieval (IR) systems. The combination of such query independent features could further enhance the retrieval performance. However, most current combination approaches are based on heuristics, which ignore the possible dependence between the document priors. In this paper, we present a novel and robust method for combining document priors in a principled way. We use a conditional probability rule, which is derived from Kolmogorov's axioms. In particular, we investigate the retrieval performance attainable by our combination of priors method, in comparison to the use of single priors and a heuristic prior combination method. Furthermore, we examine when and how document priors should be combined.
引用
收藏
页码:732 / +
页数:2
相关论文
共 50 条
  • [1] Intelligent Interface for Web Information Retrieval with Document Understanding
    Khokale, Rahul S.
    Atique, Mohammad
    HUMAN-COMPUTER INTERACTION: APPLICATIONS AND SERVICES, PT III, 2014, 8512 : 21 - 31
  • [2] Evolutionary learning of Web-document structure for information retrieval
    Kim, S
    Zhang, BT
    PROCEEDINGS OF THE 2001 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2001, : 1253 - 1260
  • [3] Web document indexing and retrieval
    Hyusein, B
    Patel, A
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 573 - 579
  • [4] Contextualisation of information retrieval process and document ranking task in web search tools
    Bouramoul, Abdelkrim
    INTERNATIONAL JOURNAL OF SPACE-BASED AND SITUATED COMPUTING, 2016, 6 (02) : 74 - 89
  • [5] Information retrieval and the virtual document
    Watters, C
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1999, 50 (11): : 1028 - 1029
  • [6] Information retrieval on the Web
    Yang, KD
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2005, 39 : 33 - 80
  • [7] Persian Web Document Retrieval Corpus
    Zinvandi, Erfan
    Alikhani, Morteza
    Pourbahman, Zahra
    Kazemi, Reza
    Amini, Arash
    2024 12TH IRAN WORKSHOP ON COMMUNICATION AND INFORMATION THEORY, IWCIT, 2024,
  • [8] Information retrieval on the Web
    Kobayashi, M
    Takeda, K
    ACM COMPUTING SURVEYS, 2000, 32 (02) : 144 - 173
  • [9] Design of a Metacrawler for Web Document Retrieval
    Babu, K. R. Remesh
    Arya, A. P.
    2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 478 - 484
  • [10] A set of novel HTML']HTML document quality features for Web information retrieval: Including applications to learning to rank for information retrieval
    Aydin, Ahmet
    Arslan, Ahmet
    Dincer, Bekir Taner
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246