GraphDBLP: a system for analysing networks of computer scientists through graph databases

被引:13
作者
Mezzanzanica, Mario [1 ]
Mercorio, Fabio [1 ]
Cesarini, Mirko [1 ]
Moscato, Vincenzo [2 ]
Picariello, Antonio [2 ]
机构
[1] Univ Milano Bicocca, CRISP Res Ctr, Dept Stat & Quantitat Methods, Milan, Italy
[2] Univ Naples Federico II, Dept Elect Engn & Informat Technol DIETI, Naples, Italy
关键词
Graph database; Word embedding; Knowledge extraction; Semantic analytics; Social network analysis; DBLP; RECOMMENDATION;
D O I
10.1007/s11042-017-5503-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents GraphDBLP, a system that models the DBLP bibliography as a graph database for performing graph-based queries and social network analyses. GraphDBLP also enriches the DBLP data through semantic keyword similarities computed via word-embedding. In this paper, we discuss how the system was formalized as a multi-graph, and how similarity relations were identified through word2vec. We also provide three meaningful queries for exploring the DBLP community to (i) investigate author profiles by analysing their publication records; (ii) identify the most prolific authors on a given topic, and (iii) perform social network analyses over the whole community. To date, GraphDBLP contains 5+ million nodes and 24+ million relationships, enabling users to explore the DBLP data by referencing more than 3.3 million publications, 1.7 million authors, and more than 5 thousand publication venues. Through the use of word-embedding, more than 7.5 thousand keywords and related similarity values were collected. GraphDBLP was implemented on top of the Neo4j graph database. The whole dataset and the source code are publicly available to foster the improvement of GraphDBLP in the whole computer science community.
引用
收藏
页码:18657 / 18688
页数:32
相关论文
共 51 条
[1]   Incorporating contextual information in recommender systems using a multidimensional approach [J].
Adomavicius, G ;
Sankaranarayanan, R ;
Sen, S ;
Tuzhilin, A .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2005, 23 (01) :103-145
[2]  
Aggarwal CC, 2011, SOCIAL NETWORK DATA ANALYTICS, P1
[3]   A Multimedia Recommender System [J].
Albanese, Massimiliano ;
d'Acierno, Antonio ;
Moscato, Vincenzo ;
Persia, Fabio ;
Picariello, Antonio .
ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2013, 13 (01)
[4]  
Amato F, 2017, FUT GEN COMPUT SYST
[5]   Survey of graph database models [J].
Angles, Renzo ;
Gutierrez, Claudio .
ACM COMPUTING SURVEYS, 2008, 40 (01)
[6]  
[Anonymous], 2017, Social Network Analysis
[7]  
[Anonymous], 2017, DISTRIBUTED GRAPH DA
[8]   Recommendations in location-based social networks: a survey [J].
Bao, Jie ;
Zheng, Yu ;
Wilkie, David ;
Mokbel, Mohamed .
GEOINFORMATICA, 2015, 19 (03) :525-565
[9]   The architecture of complex weighted networks [J].
Barrat, A ;
Barthélemy, M ;
Pastor-Satorras, R ;
Vespignani, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (11) :3747-3752
[10]  
Belk V., 2012, PROC 6 INT C WEBLOGS, P34