Document Similarity Detection using Semantic Social Network Analysis on RDF Citation Graph

被引:0
作者
Mahmood, Qamar [1 ]
Qadir, Muhammad Abdul [1 ]
Afzal, Muhammad Tanvir [1 ]
机构
[1] Mohammad Ali Jinnah Univ, Ctr Distributed & Semant Comp, Islamabad, Pakistan
来源
2013 IEEE 9TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET 2013) | 2013年
关键词
document similarity; RDF; citation graph; semantic social network analysis;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Document similarity identification is one of the most significant problems of knowledge discovery and information retrieval. One way to perform these similarity measures is to analyze a citation graph of research papers. If we have document citation information in the form of RDF graph, how we may identify the document similarity measures by using social network analysis techniques? We have answered this question by applying semantic social network analysis techniques on RDF citation graphs of research papers to identify the pair wise similarity between these papers. For performing social network analysis we have used classes of centrality degree and closeness centrality from SemSNA ontology. Concept of minimum cut/maximum flow from graph theory is used for quantification of similarity measure. In our experiment we have used Citeseer data set; it is found that our results are promising as compared to manual similarity measures by human for a subset of this data set. Our results are also encouragingly comparable to other citation link analysis techniques as well as content based similarity measures; this is the reason that we have focused on RDF citation based similarity measure. In future we are looking forward to use some citation ontology (such as CITO) to improve RDF graph construction for our proposed similarity measure technique.
引用
收藏
页码:108 / 113
页数:6
相关论文
共 27 条
[1]  
Aleman-Meza Boanerges, 2006, WWW 2006 MAY 23 26 E
[2]  
Aslam Javed A., 2003, SIGIR 03 JUL 28 AUG
[3]  
Batagelj V, 2018, PAJEK PROGRAM LARGE
[4]  
Bolelli L, 2006, LECT NOTES ARTIF INT, V4213, P30
[5]   Centrality estimation in large networks [J].
Brandes, Ulrik ;
Pich, Christian .
INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2007, 17 (07) :2303-2318
[6]  
Ereteo G., 2011, SEMANTIC SOCIAL NETW
[7]  
Ereteo G, 2009, WEB SCI
[8]  
Garner Ralph, 3 DREXEL INFORM SCI
[9]  
Hliaoutakis Angelos, 2001, IJCAI INT JOINT C AR
[10]  
Huang Anna, 2008, NZCSRSC 2008 APR CHR