Enhancing Scientific Collaborations using Community Detection and Document Clustering

被引:0
作者
Radulescu, Iulia-Maria [1 ]
Truica, Ciprian-Octavian [1 ]
Apostol, Elena-Simona [1 ]
Dobre, Ciprian [1 ,2 ]
机构
[1] Univ Politehn Bucuresti, Fac Automat Control & Comp, Comp Sci & Engn Dept, Bucharest, Romania
[2] Natl Inst Res & Dev Informat, Bucharest, Romania
来源
2020 IEEE 16TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2020) | 2020年
关键词
Community detection; Clustering; Louvain; Spherical K-Means;
D O I
10.1109/iccp51029.2020.9266267
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Community detection is the process of extracting community structured subgraphs from community networks. Most research regarding community detection has focused on the network structure without taking the content associated with the nodes into account. In this paper, we propose a new method for enhancing a co-authorship network's structure using clustering. Specifically, considering the clustering process, we use a sequence with proved performance between the WordNet lemmatizer, Document Embeddings and Spherical K-Means, while choosing the Louvain algorithm for community detection. Thus, we improve the Louvain's community detection algorithm modularity by interconnecting the author nodes for the articles clustered together. To evaluate our method, we collected a dataset containing articles' abstracts and authors. The experimental results show that our method suggests potential new collaborations by adding vertices to the graph after analysing the textual content.
引用
收藏
页码:43 / 50
页数:8
相关论文
共 24 条
[1]   Community detection in social networks using hybrid merging of sub-communities [J].
Arab, Mohsen ;
Afsharchi, Mohsen .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2014, 40 :73-84
[2]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[3]   Metrics for Community Analysis: A Survey [J].
Chakraborty, Tanmoy ;
Dalmia, Ayushi ;
Mukherjee, Animesh ;
Ganguly, Niloy .
ACM COMPUTING SURVEYS, 2017, 50 (04)
[4]   Community Detection via Maximization of Modularity and Its Variants [J].
Chen, Mingming ;
Kuzmin, Konstantin ;
Szymanski, Boleslaw K. .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2014, 1 (01) :46-65
[5]   Community detection in networks: A user guide [J].
Fortunato, Santo ;
Hric, Darko .
PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2016, 659 :1-44
[6]  
Hornik K, 2012, J STAT SOFTW, V50, P1
[7]  
Khan B. S., 2017, ARXIV PREPRINT ARXIV
[8]   Multi-co-training for document classification using various document representations: TF-IDF, LDA, and Doc2Vec [J].
Kim, Donghwa ;
Seo, Deokseong ;
Cho, Suhyoun ;
Kang, Pilsung .
INFORMATION SCIENCES, 2019, 477 :15-29
[9]  
Le Q., 2014, INT C MACH LEARN PML, P1188, DOI DOI 10.1145/2740908.2742760
[10]  
MacQueen J., 1967, P 5 BERK S MATH STAT, V5, P281