Community Trolling: An Active Learning Approach for Topic Based Community Detection in Big Data

被引:0
作者
Preeti Gupta
Rajni Jindal
Arun Sharma
机构
[1] Indira Gandhi Delhi Technical University for Women (IGDTUW),Department of IT
[2] Delhi Technological University(DTU),Department of CSE
来源
Journal of Grid Computing | 2018年 / 16卷
关键词
Active learning; Unlabeled big data; Community trolling; Community detection;
D O I
暂无
中图分类号
学科分类号
摘要
Community detection plays an important role in creation and transfer of information. Active learning has been employed recently to improve the performance of community detection techniques. Active learning provides a semi-automatic approach in a selective sampling of data. Based on this, a community trolling approach for topic based community detection in big data is proposed. Community trolling selectively samples the data relevant to the current context from polluted big data using active learning. Fine-tuned data is then used to study community and its sub-communities. Community trolling as a precursor to community detection leads to a reduction of the huge unreliable dataset into a reliable dataset and results in the better prediction of community elements such as important topics and important entities. Finally, the effectiveness of approach was evaluated by implementing it on a real world Tumbler dataset. The results illustrate that community trolling provides a richer dataset resulting in more appropriate communities.
引用
收藏
页码:553 / 567
页数:14
相关论文
共 81 条
[1]  
Abdelbary H(2013)Semantic topics modeling approach for community detection Int. J. Comput. Appl. 81 50-58
[2]  
El-Korany A(2005)A framework for learning predictive structures from multiple tasks and unlabeled data J. Mach. Learn. Res. 6 1817-1853
[3]  
Ando RK(2016)Big data 2.0 processing systems: Taxonomy and open challenges J. Grid Comput. 14 379-405
[4]  
Zhang T(2003)Latent dirichlet allocation J. Mach. Learn. Res. 3 993-1022
[5]  
Bajaber F(2011)Amazon’s mechanical turk a new source of inexpensive, yet high-quality, data? Perspect. Psychol. Sci. 6 3-5
[6]  
Elshawi R(2014)Detecting linguistic markers for radical violence in social media Terrorism and Political Violence 26 246-256
[7]  
Batarfi O(2011)Community detection: topological vs. topical J. Inf. 5 498-514
[8]  
Altalhi A(2010)Community detection in graphs Phys. Rep. 486 75-174
[9]  
Barnawi A(2013)A survey on instance selection for active learning Knowl. Inf. Syst. 35 1-35
[10]  
Sakr S(2016)Scalable machine-learning algorithms for big data analytics: a comprehensive review Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 6 194-214