RI for IR: Capturing Term Contexts Using Random Indexing for Comprehensive Information Retrieval

被引:0
|
作者
Prasath, Rajendra [1 ,2 ]
Sarkar, Sudeshna [1 ]
O'Reilly, Philip [2 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Kharagpur 721302, W Bengal, India
[2] Univ Coll, Dept Business Informat Syst, Cork, Ireland
来源
HUMAN-INSPIRED COMPUTING AND ITS APPLICATIONS, PT I | 2014年 / 8856卷
关键词
Random Indexing; Implicit Semantic Analysis; Topic Dynamics; Retrieval Effectiveness; Cross-Lingual Information Retrieval; EXPANSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an approach, based on random indexing, to identify semantically related information that effectively disambiguate the user query and improves the retrieval efficiency of news documents. User query terms are expanded based on the terms with similar word senses that are discovered by implicitly considering the "associatedness" of the document context with that of the given query. This type of associatedness is guided by word space models, as described by Kanerva et al.(2000). The word-space model computes the meaning of the terms by implicitly utilizing the distributional patterns (contexts) of words collected over large text data. The distributional patterns represent semantic similarity between words in terms of their spatial proximity in the context space. In this space, words are represented by context vectors whose relative directions are assumed to indicate semantic similarity. Motivated by this distributional hypothesis, words with similar meanings are assumed to have similar contexts. For example, if we observe two words that constantly occur with the same context, we are justified in assuming that they mean similar things. Hence the word space methodology makes semantics computable and the underlying models do not require any linguistic or semantic expertise. Experimental results done on FIRE news collection show that the proposed approach effectively captures the term contexts using higher order term associations across the collection of news documents and use such information to assist the retrieval of documents.
引用
收藏
页码:104 / 112
页数:9
相关论文
共 50 条
  • [1] RI for IR: Capturing term contexts using random indexing for comprehensive information retrieval
    Prasath, Rajendra
    Sarkar, Sudeshna
    O’Reilly, Philip
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8856 : 104 - 112
  • [2] Term indexing in information retrieval systems
    Dvorsky, J
    Krátky, M
    Skopal, T
    Snásel, V
    CIC'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN COMPUTING, 2003, : 263 - 270
  • [3] Comprehensive Study and Comparison of Information Retrieval Indexing Techniques
    Malki, Zohair
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (01) : 132 - 140
  • [4] Information Retrieval using Dynamic Indexing
    Mohammed, Sura I.
    Omara, Fatma A.
    Sharaf, Hussien M.
    2014 9TH INTERNATIONAL CONFERENCE ON INFORMATICS AND SYSTEMS (INFOS), 2014,
  • [5] Using query contexts in information retrieval
    Bai, Jing
    Nie, Jian-Yun
    Cao, Guihong
    Bouchard, Hugues
    Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07, 2007, : 15 - 22
  • [6] Fast Information Retrieval using Indexing Techniques
    Stoica Spahiu, Cosmin
    Stanescu, Liana
    Brezovan, Marius
    INTELLIGENT INTERACTIVE MULTIMEDIA SYSTEMS AND SERVICES, 2013, 254 : 89 - 98
  • [7] Discovering Features Contexts from Images Using Random Indexing
    Nakouri, Haifa
    Limam, Mohamed
    COMBINATORIAL IMAGE ANALYSIS, IWCIA 2014, 2014, 8466 : 134 - 145
  • [8] Using latent semantic indexing for multilanguage information retrieval
    Berry, MW
    Young, PG
    COMPUTERS AND THE HUMANITIES, 1995, 29 (06): : 413 - 429
  • [9] Multimodal Video Indexing and Retrieval Using Directed Information
    Chen, Xu
    Hero, Alfred O., III
    Savarese, Silvio
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) : 3 - 16
  • [10] A comprehensive review of significant researches on content based indexing and retrieval of visual information
    Priya, R.
    Shanmugam, T. N.
    FRONTIERS OF COMPUTER SCIENCE, 2013, 7 (05) : 782 - 799