Enhancing Scientific Collaborations using Community Detection and Document Clustering

被引:0
作者
Radulescu, Iulia-Maria [1 ]
Truica, Ciprian-Octavian [1 ]
Apostol, Elena-Simona [1 ]
Dobre, Ciprian [1 ,2 ]
机构
[1] Univ Politehn Bucuresti, Fac Automat Control & Comp, Comp Sci & Engn Dept, Bucharest, Romania
[2] Natl Inst Res & Dev Informat, Bucharest, Romania
来源
2020 IEEE 16TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2020) | 2020年
关键词
Community detection; Clustering; Louvain; Spherical K-Means;
D O I
10.1109/iccp51029.2020.9266267
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Community detection is the process of extracting community structured subgraphs from community networks. Most research regarding community detection has focused on the network structure without taking the content associated with the nodes into account. In this paper, we propose a new method for enhancing a co-authorship network's structure using clustering. Specifically, considering the clustering process, we use a sequence with proved performance between the WordNet lemmatizer, Document Embeddings and Spherical K-Means, while choosing the Louvain algorithm for community detection. Thus, we improve the Louvain's community detection algorithm modularity by interconnecting the author nodes for the articles clustered together. To evaluate our method, we collected a dataset containing articles' abstracts and authors. The experimental results show that our method suggests potential new collaborations by adding vertices to the graph after analysing the textual content.
引用
收藏
页码:43 / 50
页数:8
相关论文
共 24 条
[11]  
MacQueen J., 1967, P 5 BERKELEY S MATH, V1
[12]   WORDNET - A LEXICAL DATABASE FOR ENGLISH [J].
MILLER, GA .
COMMUNICATIONS OF THE ACM, 1995, 38 (11) :39-41
[13]   Finding community structure in networks using the eigenvectors of matrices [J].
Newman, M. E. J. .
PHYSICAL REVIEW E, 2006, 74 (03)
[14]   Modularity and community structure in networks [J].
Newman, M. E. J. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (23) :8577-8582
[15]   Finding and evaluating community structure in networks [J].
Newman, MEJ ;
Girvan, M .
PHYSICAL REVIEW E, 2004, 69 (02) :026113-1
[16]   Defining and identifying communities in networks [J].
Radicchi, F ;
Castellano, C ;
Cecconi, F ;
Loreto, V ;
Parisi, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (09) :2658-2663
[17]   Clustering Documents using the Document to Vector Model for Dimensionality Reduction [J].
Radu, Robert-George ;
Radulescu, Iulia-Maria ;
Truica, Ciprian-Octavian ;
Apostol, Elena-Simona ;
Mocanu, Mariana .
PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR), 2020, :57-62
[18]   Vocabulary-based Community Detection and Characterization [J].
Ramponi, Giorgia ;
Brambilla, Marco ;
Ceri, Stefano ;
Daniel, Florian ;
Di Giovanni, Marco .
SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, :1043-1050
[19]  
Rao B, 2014, IEEE I C COMP INT CO, P460
[20]  
van Laarhoven T, 2016, J MACH LEARN RES, V17