Finding Community Structure via Rough K-Means in Social Network

被引:0
作者
Zhang, Yunlei [1 ]
Wu, Bin [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing, Peoples R China
来源
PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA | 2015年
关键词
Social Network; Community Structure; Rough K-Means; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Much of the data of scientific interest, particularly when independence of data is not assumed, can be represented in the form of networks where data nodes are joined together to form edges corresponding to some kind of associations or relationships. Such information networks abound, like protein interactions in biology, web page hyperlink connections in information retrieval on the Web, cellphone call graphs in telecommunication, co-authorships in bibliometrics, crime event connections in criminology, etc. All these networks, also known as social networks, share a common property, the formation of connected groups of information nodes, called community structures. These groups are densely connected nodes with sparse connections outside the group. Finding these communities is an important task for the discovery of underlying structures in social networks, and has attracted much attention in data mining research. In this paper, we present rough k-means method (RKM), a new community mining approach that, simply put, regards a community as a set of nodes, these communities have their lower and upper approximation sets. Our algorithm starts by selecting k nodes as the center nodes of communities in a given network then iteratively assembles node to their closest center node to form communities, and subsequently calculates new center node in each group around which to gather nodes again until convergence. Our intuitions are based on proven observations in social networks and the results are promising. Experimental results on benchmark networks verify the feasibility and effectiveness of our new community mining approach.
引用
收藏
页码:2356 / 2361
页数:6
相关论文
共 34 条
[1]  
[Anonymous], ARXIV08011647
[2]  
[Anonymous], 2003, Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining
[3]   A Grid-Based Clustering Method For Mining Frequent Trips From Large-Scale, Event-Based Telematics Datasets [J].
Cao, Qing ;
Bouqata, Bouchra ;
Mackenzie, Patricia D. ;
Messier, Daniel ;
Salvo, Josheph J. .
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, :2996-3001
[4]  
CHAN PK, 1993, ACM IEEE D, P749
[5]  
Clauset A, 2004, PHYS REV E, V70, DOI 10.1103/PhysRevE.70.066111
[6]   A min-max cut algorithm for graph partitioning and data clustering [J].
Ding, CHQ ;
He, XF ;
Zha, HY ;
Gu, M ;
Simon, HD .
2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, :107-114
[7]  
Ester M., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P226
[8]  
Flake G. W., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P150, DOI 10.1145/347090.347121
[9]   Self-organization and identification of web communities [J].
Flake, GW ;
Lawrence, S ;
Giles, CL ;
Coetzee, FM .
COMPUTER, 2002, 35 (03) :66-+
[10]   Community detection in graphs [J].
Fortunato, Santo .
PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2010, 486 (3-5) :75-174