Management of Distributed Big Data for Social Networks

被引:8
作者
Leung, Carson K. [1 ]
Zhang, Hao [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
来源
2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID) | 2016年
关键词
Big data; distributed data; distributed computing; big data management; data analytics; graph data; data representation; STREAMS;
D O I
10.1109/CCGrid.2016.107
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the current era of big data, high volumes of a wide variety of valuable data can be easily collected and generated from a broad range of data sources of different veracities at a high velocity. Due to the well-known 5V's of these big data, many traditional data management approaches may not be suitable for handling the big data. Over the past few years, several applications and systems have developed to use cluster, cloud or grid computing to manage big data so as to support data science, big data analytics, as well as knowledge discovery and data mining. In this paper, we focus on distributed big data management. Specifically, we present our method for big data representation and management of distributed big data from social networks. We represent such big graph data in distributed settings so as to support big data mining of frequently occurring patterns from social networks.
引用
收藏
页码:639 / 648
页数:10
相关论文
共 29 条
  • [1] Agrawal D., P EDBT 2016, P479
  • [2] Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility
    Buyya, Rajkumar
    Yeo, Chee Shin
    Venugopal, Srikumar
    Broberg, James
    Brandic, Ivona
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (06): : 599 - 616
  • [3] Chen J., 2009, ENCY DATABASE SYSTEM, P1276
  • [4] Weaving computational grids: How analogous are they with electrical grids?
    Chetty, M
    Buyya, R
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2002, 4 (04) : 61 - 71
  • [5] Cuzzocrea A., P ACM SAC 2015, P919
  • [6] High-Recall Information Retrieval from Linked Big Data
    Cuzzocrea, Alfredo
    Lee, Wookey
    Leung, Carson K.
    [J]. 39TH ANNUAL IEEE COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2015), VOL 2, 2015, : 712 - 717
  • [7] Edge-based mining of frequent subgraphs from graph streams
    Cuzzocrea, Alfredo
    Han, Zhao
    Jiang, Fan
    Leung, Carson K.
    Zhang, Hao
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 19TH ANNUAL CONFERENCE, KES-2015, 2015, 60 : 573 - 582
  • [8] Mapreduce: Simplified data processing on large clusters
    Dean, Jeffrey
    Ghemawat, Sanjay
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (01) : 107 - 113
  • [9] Equality and Social Mobility in Twitter Discussion Groups
    Ellis, Katherine
    Goldszmidt, Moises
    Lanckriet, Gert
    Mishra, Nina
    Reingold, Omer
    [J]. PROCEEDINGS OF THE NINTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'16), 2016, : 523 - 532
  • [10] Parallel Clustering of High-Dimensional Social Media Data Streams
    Gao, Xiaoming
    Ferrara, Emilio
    Qiu, Judy
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 323 - 332