Parallel social network mining for interesting 'following' patterns

被引:55
作者
Leung, Carson Kai-Sang [1 ]
Jiang, Fan [1 ]
Pazdor, Adam G. M. [1 ]
Peddle, Aaron M. [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
关键词
knowledge discovery; social networks; parallel data mining; concurrent computation; FRIENDS;
D O I
10.1002/cpe.3773
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Social networking sites (e.g., Facebook, Google+, and Twitter) have become popular for sharing valuable knowledge and information among social entities (e.g., individual users and organizations), who are often linked by some interdependency such as friendship. As social networking sites keep growing, there are situations in which a user wants to find those frequently followed groups of social entities so that he can follow the same groups. In this article, we present (i) a space-efficient bitwise data structure for capturing interdependency among social entities; (ii) a time-efficient data mining algorithm that makes the best use of our proposed data structure for serial discovery of groups of frequently followed social entities; and (iii) another time-efficient data mining algorithm for concurrent computation and discovery of groups of frequently followed social entities in parallel so as to handle high volumes of social network data. Evaluation results show the efficiency and practicality of our data structure and social network data mining algorithms. Copyright (C) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:3994 / 4012
页数:19
相关论文
共 28 条
  • [1] Agrawal R., P 20 INT C VERY LARG
  • [2] Brijs T., 1999, Proceedings of the fth ACM SIGKDD inter- national conference on Knowledge discovery and data mining, P254, DOI 10.1145/312129.312241
  • [3] Cuzzocrea A., 2015, P EDBT ICDT WORKSH 2, P237
  • [4] Mining constrained frequent itemsets from distributed uncertain data
    Cuzzocrea, Alfredo
    Leung, Carson Kai-Sang
    MacKinnon, Richard Kyle
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 37 : 117 - 126
  • [5] Dhahri Nabila, 2012, Data Warehousing and Knowledge Discovery. Proceedings of the 14th International Conference, DaWaK 2012, P253, DOI 10.1007/978-3-642-32584-7_21
  • [6] Fan Jiang, 2014, Data Warehousing and Knowledge Discovery. 16th International Conference (DaWaK 2014). Proceedings: LNCS 8646, P308, DOI 10.1007/978-3-319-10160-6_28
  • [7] Fan Jiang, 2013, Data Warehousing and Knowledge Discovery. 15th International Conference (DaWaK 2013). Proceedings: LNCS 8057, P209, DOI 10.1007/978-3-642-40131-2_18
  • [8] Han JW, 2000, SIGMOD RECORD, V29, P1
  • [9] Discovery of Really Popular Friends from Social Networks
    Jiang, Fan
    Leung, Carson Kai-Sang
    Liu, Dacheng
    Peddle, Aaron M.
    [J]. 2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 342 - 349
  • [10] Ke Wang, 2002, Advances in Knowledge Discovery and Data Mining. 6th Pacific-Asia Conference, PAKDD 2002. Proceedings (Lecture Notes in Artificial Intelligence Vol.2336), P334