Wikipedia-based query phrase expansion in patent class search

被引:23
|
作者
Al-Shboul, Bashar [1 ]
Myaeng, Sung-Hyon [2 ,3 ]
机构
[1] Univ Jordan, Dept Business Informat Technol, Amman 11942, Jordan
[2] Korea Adv Inst Sci & Technol, Dept Comp Sci, Taejon 305701, South Korea
[3] Korea Adv Inst Sci & Technol, Div Web Sci, Taejon 305701, South Korea
来源
INFORMATION RETRIEVAL | 2014年 / 17卷 / 5-6期
基金
新加坡国家研究基金会;
关键词
Patent search; Phrase-based query expansion; Wikipedia categories; Clarity; Retrievability; INFORMATION-RETRIEVAL; LEXICAL COHESION; TERMS;
D O I
10.1007/s10791-013-9233-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Relevance feedback methods generally suffer from topic drift caused by word ambiguities and synonymous uses of words. Topic drift is an important issue in patent information retrieval as people tend to use different expressions describing similar concepts causing low precision and recall at the same time. Furthermore, failing to retrieve relevant patents to an application during the examination process may cause legal problems caused by granting an existing invention. A possible cause of topic drift is utilizing a relevance feedback-based search method. As a way to alleviate the inherent problem, we propose a novel query phrase expansion approach utilizing semantic annotations in Wikipedia pages, trying to enrich queries with phrases disambiguating the original query words. The idea was implemented for patent search where patents are classified into a hierarchy of categories, and the analyses of the experimental results showed not only the positive roles of phrases and words in retrieving additional relevant documents through query expansion but also their contributions to alleviating the query drift problem. More specifically, our query expansion method was compared against relevance-based language model, a state-of-the-art query expansion method, to show its superiority in terms of MAP on all levels of the classification hierarchy.
引用
收藏
页码:430 / 451
页数:22
相关论文
共 50 条
  • [31] English Nominal Compound Detection with Wikipedia-Based Methods
    Nagy, Istvan T.
    Vincze, Veronika
    TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 225 - 232
  • [32] Wikipedia-Based Extraction of Key Information from Resumes
    Ghufran, Mohammad
    Bennacer, Nacera
    Quercini, Gianluca
    2017 11TH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2017, : 135 - 145
  • [33] Wikipedia-based cross-language text classification
    Mourino Garcia, Marcos Antonio
    Perez Rodriguez, Roberto
    Anido Rifon, Luis
    INFORMATION SCIENCES, 2017, 406 : 12 - 28
  • [34] Wikipedia-based Named Entity Recognition System for Turkish
    Kucuk, Dogan
    Arici, Nursal
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2016, 19 (03): : 325 - 332
  • [35] Tools and strategies: Incorporating Wikipedia-based assignments into a course
    Salvaggio, Eryk
    Mathewson, Jami
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 250
  • [36] Interactive Query Expansion using Concept-Based Directions Finder Based on Wikipedia
    Meiyappan, Yuvarani
    Iyengar, Sriman Narayana
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2013, 10 (06) : 571 - 578
  • [37] Wikipedia-based approach for linking ontology concepts to their realisations in text
    Paci, Giulio
    Pedrazzi, Giorgio
    Turra, Roberta
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3861 - 3866
  • [38] A new approach for query expansion using Wikipedia and WordNet
    Azad, Hiteshwar Kumar
    Deepak, Akshay
    INFORMATION SCIENCES, 2019, 492 : 147 - 163
  • [39] A PATENT RETRIEVAL QUERY EXPANSION METHOD BASED ON SEMANTIC DICTIONARY
    Xu, Kan
    Feng, Jiaojiao
    Wang, Kaiqiao
    Lin, Hongfei
    Lin, Yuan
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2019, 20 (06) : 1233 - 1240
  • [40] A Semantic Query Expansion-based Patent Retrieval Approach
    Wang, Feng
    Lin, Lanfen
    Yang, Shuai
    Zhu, Xiaowei
    2013 10TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2013, : 572 - 577