Knowledge-Enhanced Latent Semantic Indexing

被引:0
|
作者
David Guo
Michael W. Berry
Bryan B. Thompson
Sidney Bailin
机构
[1] University of Tennessee,Department of Computer Science
[2] Global Wisdom,undefined
[3] Inc.,undefined
[4] Knowledge Evolution,undefined
[5] Inc.,undefined
来源
Information Retrieval | 2003年 / 6卷
关键词
latent semantic indexing; MeSH; metathesaurus; OHSUMED; semantic network; UMLS;
D O I
暂无
中图分类号
学科分类号
摘要
Latent Semantic Indexing (LSI) is a popular information retrieval model for concept-based searching. As with many vector space IR models, LSI requires an existing term-document association structure such as a term-by-document matrix. The term-by-document matrix, constructed during document parsing, can only capture weighted vocabulary occurrence patterns in the documents. However, for many knowledge domains there are pre-existing semantic structures that could be used to organize and categorize information. The goals of this study are (i) to demonstrate how such semantic structures can be automatically incorporated into the LSI vector space model, and (ii) to measure the effect of these structures on query matching performance. The new approach, referred to as Knowledge-Enhanced LSI, is applied to documents in the OHSUMED medical abstracts collection using the semantic structures provided by the UMLS Semantic Network and MeSH. Results based on precision-recall data (11-point average precision values) indicate that a MeSH-enhanced search index is capable of delivering noticeable incremental performance gain (as much as 35%) over the original LSI for modest constraints on precision. This performance gain is achieved by replacing the original query with the MeSH heading extracted from the query text via regular expression matches.
引用
收藏
页码:225 / 250
页数:25
相关论文
共 50 条
  • [1] Knowledge-enhanced latent semantic indexing
    Guo, D
    Berry, MW
    Thompson, BB
    Bailin, S
    INFORMATION RETRIEVAL, 2003, 6 (02): : 225 - 250
  • [2] Knowledge-enhanced semantic communication system with OFDM transmissions
    Xu, Xiaodong
    Xiong, Huachao
    Wang, Yining
    Che, Yue
    Han, Shujun
    Wang, Bizhu
    Zhang, Ping
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (07)
  • [3] Knowledge-enhanced semantic communication system with OFDM transmissions
    Xiaodong XU
    Huachao XIONG
    Yining WANG
    Yue CHE
    Shujun HAN
    Bizhu WANG
    Ping ZHANG
    ScienceChina(InformationSciences), 2023, 66 (07) : 266 - 282
  • [4] Chinese Relation Extraction with External Knowledge-Enhanced Semantic Understanding
    Lv, Shulin
    Ding, Xiaoyao
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (02) : 1317 - 1324
  • [5] Modeling and diagnosing domain knowledge using Latent Semantic Indexing
    Freeman, JT
    Thompson, BB
    Cohen, MS
    PROCEEDINGS OF THE HUMAN FACTORS AND ERGONOMICS SOCIETY 43RD ANNUAL MEETING, VOLS 1 AND 2, 1999, : 233 - 236
  • [6] Sprinkled Latent Semantic Indexing for Text Classification with Background Knowledge
    Yang, Haiqin
    King, Irwin
    ADVANCES IN NEURO-INFORMATION PROCESSING, PT II, 2009, 5507 : 53 - 60
  • [7] Enhanced approach for latent semantic indexing using wavelet transform
    Jaber, T.
    Amira, A.
    Milligan, P.
    IET IMAGE PROCESSING, 2012, 6 (09) : 1236 - 1245
  • [8] Probabilistic latent semantic indexing
    Hofmann, T
    SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 50 - 57
  • [9] Regularized Latent Semantic Indexing
    Wang, Quan
    Xu, Jun
    Li, Hang
    Craswell, Nick
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 685 - 694
  • [10] INDEXING BY LATENT SEMANTIC ANALYSIS
    DEERWESTER, S
    DUMAIS, ST
    FURNAS, GW
    LANDAUER, TK
    HARSHMAN, R
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1990, 41 (06): : 391 - 407