DOCUMENT RETRIEVAL USING A PROBABILISTIC KNOWLEDGE MODEL

被引:0
作者
Wang, Shuguang [1 ]
Visweswaran, Shyam [2 ]
Hauskrecht, Milos [3 ]
机构
[1] Univ Pittsburgh, Intelligent Syst Program, Pittsburgh, PA 15260 USA
[2] Univ Pittsburgh, Dept Biomed Informat, Pittsburgh, PA 15260 USA
[3] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
来源
KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL | 2009年
关键词
Information retrieval; Link analysis; Domain knowledge; Biomedical documents; Probabilistic model; EXPANSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We are interested in enhancing information retrieval methods by incorporating domain knowledge. In this paper, we present a new document retrieval framework that learns a probabilistic knowledge model and exploits this model to improve document retrieval. The knowledge model is represented by a network of associations among concepts defining key domain entities and is extracted from a corpus of documents or from a curated domain knowledge base. This knowledge model is then used to perform concept-related probabilistic inferences using link analysis methods and applied to the task of document retrieval. We evaluate this new framework on two biomedical datasets and show that this novel knowledge-based approach outperforms the state-of-art Lemur/Indri document retrieval method.
引用
收藏
页码:26 / +
页数:2
相关论文
共 14 条
  • [1] Aronson AR, 1997, J AM MED INFORM ASSN, P485
  • [2] Biittcher S., 2004, TREC 04
  • [3] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [4] Cohn D., 2000, ICML, P167
  • [5] Collins Michael J., 1999, THESIS
  • [6] Probabilistic latent semantic indexing
    Hofmann, T
    [J]. SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 50 - 57
  • [7] An introduction to latent semantic analysis
    Landauer, TK
    Foltz, PW
    Laham, D
    [J]. DISCOURSE PROCESSES, 1998, 25 (2-3) : 259 - 284
  • [8] Lavrenko V., 2001, SIGIR Forum, P120
  • [9] Lee W.-J., 2007, DILS 07, P27
  • [10] Lin J., 2006, Proceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P99, DOI 10.1145/1148170.1148191