A Framework for Scheduling and Managing Big Data Applications in a Distributed Infrastructure

被引:0
作者
Govindarajan, Kannan [1 ]
Somasundaram, Thamarai Selvi [2 ]
Boulanger, David [1 ]
Kumar, Vivekanandan Suresh [1 ]
Kinshuk [1 ]
机构
[1] Athabasca Univ, Edmonton, AB, Canada
[2] Anna Univ, Madras, Tamil Nadu, India
来源
2015 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC) | 2015年
关键词
big data; grid computing; cloud computing; cluster computing; software defined networking; distributed processing; Hadoop Distributed File System;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, big data has received attention from researchers, business industries, education, and scientific communities. Big data analytics has to deal with large scale data that consist of both structured and unstructured data. These data are to be handled properly, that is extracting, processing, and analyzing those data to obtain meaningful information from them in a limited time. To yield insightful information, the processing of big data analytics requires high performance computing system, storage, and network resources. Hence, it is essential to design a high performance computing infrastructure with sufficient bandwidth which is capable to handle the big data processing in an efficient manner. However, the current network architectures in those infrastructures, with predefined network policies, do not allow for just-in-time reconfiguration of the networking infrastructure as demanded by big data analytics. In addressing these limitations, Software-Defined Networking (SDN) offers the means to dynamically configure the network parameters, dynamically provision the networks, and the network itself can be sliced in an on-demand manner. This research aims to characterize SDN with respect to the demands of big data analytics in Cluster, Grid, and Cloud Computing resources. The main motivation behind this research study is to design and develop an intelligent framework named as Big Data Analytics Management System (BDAMS) for collectively managing the compute, storage, and network resources in Cluster, Grid, and Cloud infrastructure for big data analytics.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] An Efficient Data Scheduling Scheme for Cloud-based Big Data Framework for Smart City
    Nasser, Nidal
    Khan, Nargis
    ElAttar, Mohamed
    Saleh, Kassem
    Abujamous, Amjad
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [22] BigDL: A Distributed Deep Learning Framework for Big Data
    Dai, Jason
    Wang, Yiheng
    Qiu, Xin
    Ding, Ding
    Zhang, Yao
    Wang, Yanzhang
    Jia, Xianyan
    Zhang, Cherry
    Wan, Yan
    Li, Zhichao
    Wang, Jiao
    Huang, Shengsheng
    Wu, Zhongyuan
    Wang, Yang
    Yang, Yuhao
    She, Bowen
    Shi, Dongjie
    Lu, Qi
    Huang, Kai
    Song, Guoqiong
    PROCEEDINGS OF THE 2019 TENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '19), 2019, : 50 - 60
  • [23] Efficient jobs scheduling approach for big data applications
    Shao, Yanling
    Li, Chunlin
    Gu, Jinguang
    Zhang, Jing
    Luo, Youlong
    COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 117 : 249 - 261
  • [24] Computing infrastructure for big data processing
    Liu, Ling
    FRONTIERS OF COMPUTER SCIENCE, 2013, 7 (02) : 165 - 170
  • [25] Computing infrastructure for big data processing
    Ling Liu
    Frontiers of Computer Science, 2013, 7 : 165 - 170
  • [26] ClimateSpark: An in-memory distributed computing framework for big climate data analytics
    Hu, Fei
    Yang, Chaowei
    Schnase, John L.
    Duffy, Daniel Q.
    Xu, Mengchao
    Bowen, Michael K.
    Lee, Tsengdar
    Song, Weiwei
    COMPUTERS & GEOSCIENCES, 2018, 115 : 154 - 166
  • [27] FSBD: A Framework for Scheduling of Big Data Mining in Cloud Computing
    Ismail, Leila
    Masud, Mohammad M.
    Khan, Latifur
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 513 - 520
  • [28] PRIMEBALL: A Parallel Processing Framework Benchmark for Big Data Applications in the Cloud
    Ferrarons, Jaume
    Adhana, Mulu
    Colmenares, Carlos
    Pietrowska, Sandra
    Bentayeb, Fadila
    Darmont, Jerome
    PERFORMANCE CHARACTERIZATION AND BENCHMARKING, 2014, 8391 : 109 - 124
  • [29] IntegrityMR: Integrity Assurance Framework for Big Data Analytics and Management Applications
    Wang, Yongzhi
    Wei, Jinpeng
    Srivatsa, Mudhakar
    Duan, Yucong
    Du, Wencai
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [30] PaPar: A Parallel Data Partitioning Framework for Big Data Applications
    Wang, Hao
    Zhang, Jing
    Zhang, Da
    Pumma, Sarunya
    Feng, Wu-chun
    2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 605 - 614