Improving large-scale search engines with semantic annotations

被引:4
|
作者
Fuentes-Lorenzo, Damaris [1 ]
Fernandez, Norberto [1 ]
Fisteus, Jesus A. [1 ]
Sanchez, Luis [1 ]
机构
[1] Univ Carlos III Madrid, Madrid 28911, Spain
关键词
Semantic annotation; Semantic search; Wikipedia; Click-through data; Ranking algorithm; Collaborative tagging; INFORMATION-RETRIEVAL;
D O I
10.1016/j.eswa.2012.10.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional search engines have become the most useful tools to search the World Wide Web. Even though they are good for certain search tasks, they may be less effective for others, such as satisfying ambiguous or synonym queries. In this paper, we propose an algorithm that, with the help of Wikipedia and collaborative semantic annotations, improves the quality of web search engines in the ranking of returned results. Our work is supported by (1) the logs generated after query searching, (2) semantic annotations of queries and (3) semantic annotations of web pages. The algorithm makes use of this information to elaborate an appropriate ranking. To validate our approach we have implemented a system that can apply the algorithm to a particular search engine. Evaluation results show that the number of relevant web resources obtained after executing a query with the algorithm is higher than the one obtained without it. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2287 / 2296
页数:10
相关论文
共 50 条
  • [41] An Arabic Semantic Search Engine for Large Governmental Organization
    Medhat, Walaa
    Fouad, Khaled
    Yousef, Ahmed H.
    Moawad, Ibrahim F.
    2017 12TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2017, : 564 - 570
  • [42] A Visual Backchannel for Large-Scale Events
    Doerk, Marian
    Gruen, Daniel
    Williamson, Carey
    Carpendale, Sheelagh
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (06) : 1129 - 1138
  • [43] A Comprehensive Extraction of Relevant Real-World-Event Qualifiers for Semantic Search Engines
    Bernard, Guillaume
    Suire, Cyrille
    Faucher, Cyril
    Doucet, Antoine
    LINKING THEORY AND PRACTICE OF DIGITAL LIBRARIES, TPDL 2021, 2021, 12866 : 153 - 164
  • [44] Mining Concept Sequences from Large-Scale Search Logs for Context-Aware Query Suggestion
    Liao, Zhen
    Jiang, Daxin
    Chen, Enhong
    Pei, Jian
    Cao, Huanhuan
    Li, Hang
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (01)
  • [45] Improving Clinical Case Search Using Semantic Based Query Reformulations
    Alsulmi, Mohammad
    Carterette, Ben
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 694 - 698
  • [46] Semantic Search and Analytics over Large Repository of Scientific Articles
    Hung Son Nguyen
    Slezak, Dominik
    Skowron, Andrzej
    Bazan, Jan G.
    INTELLIGENT TOOLS FOR BUILDING A SCIENTIFIC INFORMATION PLATFORM, 2012, 390 : 1 - 8
  • [47] Mining temporal explicit and implicit semantic relations between entities using web search engines
    Xu, Zheng
    Luo, Xiangfeng
    Zhang, Shunxiang
    Wei, Xiao
    Mei, Lin
    Hu, Chuanping
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 37 : 468 - 477
  • [48] TianGong-ST: A New Dataset with Large-scale Refined Real-world Web Search Sessions
    Chen, Jia
    Mao, Jiaxin
    Liu, Yiqun
    Zhang, Min
    Ma, Shaoping
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2485 - 2488
  • [49] Rare disease diagnosis: A review of web search, social media and large-scale data-mining approaches
    Svenstrup, Dan
    Jorgensen, Henrik L.
    Winther, Ole
    RARE DISEASES, 2015, 3 (01)
  • [50] Facebook Content Search: Efficient and Effective Adapting Search on a Large Scale
    Niu, Xiangyu
    Wu, Yu-Wei
    Lu, Xiao
    Nagpal, Gautam
    Pronin, Philip
    Hao, Kecheng
    Liao, Zhen
    Liao, Guangdeng
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3290 - 3294