Community detection in Social Media Performance and application considerations

被引:401
作者
Papadopoulos, Symeon [1 ,2 ]
Kompatsiaris, Yiannis [1 ]
Vakali, Athena [2 ]
Spyridonos, Ploutarchos [2 ]
机构
[1] Informat & Telemat Inst, CERTH, Thessaloniki, Greece
[2] Aristotle Univ Thessaloniki, Dept Informat, GR-54006 Thessaloniki, Greece
关键词
Community detection; Large-scale networks; Social Media; COMPLEX NETWORKS; ALGORITHM; CUTS;
D O I
10.1007/s10618-011-0224-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The proposed survey discusses the topic of community detection in the context of Social Media. Community detection constitutes a significant tool for the analysis of complex networks by enabling the study of mesoscopic structures that are often associated with organizational and functional characteristics of the underlying networks. Community detection has proven to be valuable in a series of domains, e.g. biology, social sciences, bibliometrics. However, despite the unprecedented scale, complexity and the dynamic nature of the networks derived from Social Media data, there has only been limited discussion of community detection in this context. More specifically, there is hardly any discussion on the performance characteristics of community detection methods as well as the exploitation of their results in the context of real-world web mining and information retrieval scenarios. To this end, this survey first frames the concept of community and the problem of community detection in the context of Social Media, and provides a compact classification of existing algorithms based on their methodological principles. The survey places special emphasis on the performance of existing methods in terms of computational complexity and memory requirements. It presents both a theoretical and an experimental comparative discussion of several popular methods. In addition, it discusses the possibility for incremental application of the methods and proposes five strategies for scaling community detection to real-world networks of huge scales. Finally, the survey deals with the interpretation and exploitation of community detection results in the context of intelligent web applications and services.
引用
收藏
页码:515 / 554
页数:40
相关论文
共 111 条
[1]  
Andersen R, 2006, ANN IEEE SYMP FOUND, P475
[2]  
[Anonymous], P 2008 INT C CONT BA
[3]  
[Anonymous], 2006, P 12 ACM SIGKDD INT, DOI [10.1145/1150402.1150467, DOI 10.1145/1150402.1150467]
[4]  
[Anonymous], 2010, Proceedings of the 19th International Conference on World Wide Web, DOI DOI 10.1145/1772690.1772762
[5]   Finding and evaluating community structure in networks [J].
Newman, MEJ ;
Girvan, M .
PHYSICAL REVIEW E, 2004, 69 (02) :026113-1
[6]  
[Anonymous], BOOK COMMUNITY BUILT
[7]  
[Anonymous], ARXIV08044356
[8]  
[Anonymous], 2008, P 2008 INT C WEB SEA, DOI [DOI 10.1145/1341531.1341557, 10.1145/1341531.1341557]
[9]  
[Anonymous], 1971, Journal of Mathematical Sociology, DOI 10.1080/0022250X.1971.9989788
[10]  
[Anonymous], ACM S PRINC DAT SYST