Heuristic semantic walk for concept chaining in collaborative networks

被引:25
作者
Franzoni, Valentina [1 ]
Milani, Alfredo [1 ,2 ,3 ]
机构
[1] Univ Perugia, Dept Math & Comp Sci, Perugia, Italy
[2] Hong Kong Baptist Univ, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
[3] Univ Perugia, Perugia, Italy
关键词
Data mining; Advanced web applications; Communities on the web; Heuristics search; Semantic similarity measures; Web mining; Web search and information extraction; Semantic networks; Collaborative networks; Random walk;
D O I
10.1108/IJWIS-11-2013-0031
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - In this work, a new general framework is proposed to guide navigation over a collaborative concept network, in order to discover paths between concepts. Finding semantic chains between concepts over a semantic network is an issue of great interest for many applications, such as explanation generation and query expansion. Collaborative concept networks over the web tend to have features such as large dimensions, high connectivity degree, dynamically evolution over the time, which represent special challenges for efficient graph search methods, since they result in huge memory requirements, high branching factors, unknown dimensions and high cost for accessing nodes. The paper aims to discuss these issues. Design/methodology/approach - The proposed framework is based on the novel notion of heuristic semantic walk (HSW). In the HSW framework, a semantic proximity measure among concepts, reflecting the collective knowledge embedded in search engines or other statistical sources, is used as a heuristic in order to guide the search in the collaborative network. Different search strategies, information sources and proximity measures, can be used to adapt HSW to the collaborative semantic network under consideration. Findings - Experiments held on the Wikipedia network and Bing search engine on a range of different semantic measures show that the proposed HSW approach with weighted randomized walk strategy outperforms state-of-the-art search methods. Originality/value - To the best of the authors' knowledge, the proposed HSW model is the first approach which uses search engine-based proximity measures as heuristic for semantic search.
引用
收藏
页码:85 / +
页数:20
相关论文
共 30 条
[1]   A Web Search Engine-Based Approach to Measure Semantic Similarity between Words [J].
Bollegala, Danushka ;
Matsuo, Yutaka ;
Ishizuka, Mitsuru .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (07) :977-990
[2]  
Cao G., 2007, P 16 ACM C INF KNOWL
[3]  
CHURCH KW, 1990, 27TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, P76
[4]   The Google similarity distance [J].
Cilibrasi, Rudi L. ;
Vitanyi, Paul M. B. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (03) :370-383
[5]  
Etzioni O., 1996, MOVING INFORM FOOD C
[6]  
Franzoni V, 2013, LECT NOTES COMPUT SC, V7974, P643
[7]  
Franzoni V, 2012, 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 2, P442, DOI [10.1109/WI-IAT.2012.226, 10.1109/WMAT.2012.226]
[8]  
Gabrilovich E, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1606
[9]   Exact and approximate graph matching using random walks [J].
Gori, M ;
Maggini, M ;
Sarti, L .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (07) :1100-1111
[10]  
Kurant M., 2010, ON THE BIAS OF BSF