Semantic similarity-based PageRank using wordnet

被引:1
作者
Poomagal, S. [1 ]
Hamsapriya, T. [2 ]
Visalakshi, P. [3 ]
机构
[1] PSG Coll Technol, Dept Comp & Informat Sci, Coimbatore 641004, Tamil Nadu, India
[2] Oriental Inst Sci & Technol, Bhopal 462021, India
[3] PSG Coll Technol, Dept Elect & Commun Engn, Coimbatore 641004, Tamil Nadu, India
关键词
link analysis; semantic similarity; Wordnet; PageRank; PR;
D O I
10.1504/IJCAT.2013.052292
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the huge volume of web pages that exist today, search engines play an important role in finding the required information. It orders search results by performing link analysis. However, existing link analysis techniques have not considered the semantic similarity among the linked documents for rank calculation. Since links from semantically similar documents are more important than the links from other dissimilar documents, this work introduces a new method for ranking web pages based on the semantic similarity among the web pages and the link structure. Wu and Palmer (1994) measure of wordnet is used to find the semantic relationship between the terms in different documents. Cosine similarity measure is used to find the similarity among the documents. Proposed technique is compared with existing ranking algorithms using the measures precision, recall and F-measure. From the results, it is observed that the proposed method brings more relevant documents to the beginning of the list of search results than the existing methods.
引用
收藏
页码:101 / 112
页数:12
相关论文
共 22 条
[11]  
Kleinberg J.M., 1998, 9 ANN ACM SIAM S DIS, P668
[12]  
Kosala R., 2000, SIGKDD EXPLORATIONS, V2, P1, DOI DOI 10.1145/360402.360406
[13]  
Langville A. N., 2006, GOOGLES PAGERANK SCI
[14]   SALSA: The stochastic approach for link-structure analysis [J].
Lempel, R ;
Moran, S .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2001, 19 (02) :131-160
[15]   The stochastic approach for link-structure analysis (SALSA) and the TKC effect [J].
Lempel, R ;
Moran, S .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6) :387-401
[16]  
Madria S. K., 1999, Data Warehousing and Knowledge Discovery. First International Conference, DaWaK'99. Proceedings (Lecture Notes in Computer Science Vol.1676), P303
[17]   Web mining in soft computing framework: Relevance, state of the art and future directions [J].
Pal, SK ;
Talwar, V ;
Mitra, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (05) :1163-1177
[18]   A Link-Based Ranking Algorithm for Semantic Web Resources: A Class-Oriented Approach Independent of Link Direction [J].
Park, Hyunjung ;
Rho, Sangkyu ;
Park, Jinsoo .
JOURNAL OF DATABASE MANAGEMENT, 2011, 22 (01) :1-25
[19]   ACO-based BW algorithm for parameter estimation of hidden Markov models [J].
Wang, Qingmiao ;
Ju, Shiguang .
INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2011, 41 (3-4) :281-286
[20]  
Yen C-C., 2010, J CONVERGENCE INFORM, V5, P165