Semantic similarity-based PageRank using wordnet

被引:1
作者
Poomagal, S. [1 ]
Hamsapriya, T. [2 ]
Visalakshi, P. [3 ]
机构
[1] PSG Coll Technol, Dept Comp & Informat Sci, Coimbatore 641004, Tamil Nadu, India
[2] Oriental Inst Sci & Technol, Bhopal 462021, India
[3] PSG Coll Technol, Dept Elect & Commun Engn, Coimbatore 641004, Tamil Nadu, India
关键词
link analysis; semantic similarity; Wordnet; PageRank; PR;
D O I
10.1504/IJCAT.2013.052292
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the huge volume of web pages that exist today, search engines play an important role in finding the required information. It orders search results by performing link analysis. However, existing link analysis techniques have not considered the semantic similarity among the linked documents for rank calculation. Since links from semantically similar documents are more important than the links from other dissimilar documents, this work introduces a new method for ranking web pages based on the semantic similarity among the web pages and the link structure. Wu and Palmer (1994) measure of wordnet is used to find the semantic relationship between the terms in different documents. Cosine similarity measure is used to find the similarity among the documents. Proposed technique is compared with existing ranking algorithms using the measures precision, recall and F-measure. From the results, it is observed that the proposed method brings more relevant documents to the beginning of the list of search results than the existing methods.
引用
收藏
页码:101 / 112
页数:12
相关论文
共 22 条
[1]  
[Anonymous], 1999, SIDLWP19990120
[2]  
Bendersky Michael, 2011, P WSDM, P95, DOI DOI 10.1145/1935826.1935849
[3]   The anatomy of a large-scale hypertextual Web search engine [J].
Brin, S ;
Page, L .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7) :107-117
[4]  
Chabra S., 2011, INT J COMPUTER APPL, P15
[5]   Automatic resource compilation by analyzing hyperlink structure and associated text [J].
Chakrabarti, S ;
Dom, B ;
Raghava, P ;
Rajagopalan, S ;
Gibson, D ;
Kleinberg, J .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7) :65-74
[6]  
Chakrabarti Soumen, 1998, ACM SIGMOD RECORD, V27, P307, DOI [DOI 10.1145/276305.276332, DOI 10.1145/276304.276332]
[7]  
Christoph B., 2011, P 14 INT C EXT DAT T, P546
[8]  
DONG A., 2010, P 3 ACM INT C WEB SE, P11
[9]   A fast search method of similar strings from dictionaries [J].
Fuketa, Masao ;
Atlam, El-Sayed ;
Fujisawa, Nobuo ;
Hanafusa, Hiroshi ;
Morita, Kazuhiro ;
Aoe, Jun-ichi .
INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2011, 40 (04) :265-272
[10]  
Gao LC, 2010, INT J COMPUT APPL T, V38, P306, DOI 10.1504/IJCAT.2010.034531