Semantically-enhanced information retrieval using multiple knowledge sources

被引:12
作者
Jiang, Yuncheng [1 ]
机构
[1] South China Normal Univ, Sch Comp Sci, Guangzhou 510631, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2020年 / 23卷 / 04期
基金
中国国家自然科学基金;
关键词
Information retrieval; Keyword search; Semantic relatedness; Multiple knowledge sources; WORD SENSE DISAMBIGUATION; LINKED DATA; SEARCH; ONTOLOGY; WEB; SIMILARITY; WIKIPEDIA; CONSTRUCTION; POINT;
D O I
10.1007/s10586-020-03057-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classical or traditional Information Retrieval (IR) approaches rely on the word-based representations of query and documents in the collection. The specification of the user information need is completely based on words figuring in the original query in order to retrieve documents containing those words. Such approaches have been limited due to the absence of relevant keywords as well as the term variation in documents and user's query. The purpose of this paper is to present a new method to Semantic Information Retrieval (SIR) to solve the limitations of existing approaches. Concretely, we propose a novel method SIRWWO (Semantic Information Retrieval using Wikipedia, WordNet, and domain Ontologies) for SIR by combining multiple knowledge sources Wikipedia, WordNet, and Description Logic (DL) ontologies. In order to illustrate the approach SIRWWO, we first present the notion of Labeled Dynamic Semantic Network (LDSN) by extending the notions of dynamic semantic network and extended semantic net based on WordNet (and DAML ontology library). According to the notion of LDSN, we obtain the notion of Weighted Dynamic Semantic Network (WDSN, intuitively, each edge in WDSN is assigned to a number in the [0, 1] interval) and give the WDSN construction method using Wikipedia, WordNet, and DL ontology. We then propose a novel metric to measure the semantic relatedness between concepts based on WDSN. Lastly, we investigate the approach SIRWWO by using semantic relatedness between users' query keywords and digital documents. The experimental results show that our proposals obtain comparable and better performance results than other traditional IR system Lucene.
引用
收藏
页码:2925 / 2944
页数:20
相关论文
共 68 条
[51]   Inter-organisational knowledge transfer in social networks: A definition of intermediate ties [J].
Retzer, Silke ;
Yoong, Pak ;
Hooper, Val .
INFORMATION SYSTEMS FRONTIERS, 2012, 14 (02) :343-361
[52]   An Ontology-Driven Approach for Semantic Information Retrieval on the Web [J].
Rinaldi, Antonio M. .
ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2009, 9 (03)
[53]  
Rocha C., 2004, P 13 INT C WORLD WID, P374, DOI DOI 10.1145/988672.988723
[54]   A novel semantic information retrieval system based on a three-level domain model [J].
Sbattella, Licia ;
Tedesco, Roberto .
JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (05) :1426-1452
[55]   Intelligent ontology based semantic information retrieval using feature selection and classification [J].
Selvalakshmi, B. ;
Subramaniam, M. .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5) :12871-12881
[56]   Unsupervised word sense disambiguation using WordNet relatives [J].
Seo, HC ;
Chung, HJ ;
Rim, HC ;
Myaeng, SH ;
Kim, SH .
COMPUTER SPEECH AND LANGUAGE, 2004, 18 (03) :253-273
[57]   Pellet: A practical OWL-DL reasoner [J].
Sirin, Evren ;
Parsia, Bijan ;
Grau, Bernardo Cuenca ;
Kalyanpur, Aditya ;
Katz, Yarden .
JOURNAL OF WEB SEMANTICS, 2007, 5 (02) :51-53
[58]   Vantage Point Latent Semantic Indexing for multimedia web document search [J].
Srikanth, D. ;
Sakthivel, S. .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5) :10587-10594
[59]   CONQUER: A methodology for context-aware query processing on the World Wide Web [J].
Storey, Veda C. ;
Burton-Jones, Andrew ;
Sugurnaran, Vijayan ;
Purao, Sandeep .
INFORMATION SYSTEMS RESEARCH, 2008, 19 (01) :3-25
[60]   Knowledge Engineering: Principles and methods [J].
Studer, R ;
Benjamins, VR ;
Fensel, D .
DATA & KNOWLEDGE ENGINEERING, 1998, 25 (1-2) :161-197