Analysis of a Network IO Bottleneck in Big Data Environments Based on Docker Containers

被引:8
作者
Varma, P. China Venkanna
Chakravarthy, K. Venkata Kalyan
Kumari, V. Valli
Raju, S. Viswanadha
机构
关键词
Containers; Context switching; Docker; Hadoop; Map reduce;
D O I
10.1016/j.bdr.2015.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We live in a world increasingly driven by data with more information about individuals, companies and governments available than ever before. Now, every business is powered by Information Technology and generating Big data. Future Business Intelligence can be extracted from the big data. NoSQL [1] and Map-Reduce [2] technologies find an efficient way to store, organize and process the big data using Virtualization and Linux Container (a.k.a. Container) [3] technologies. Provisioning containers on top of virtual machines is a better model for high resource utilization. As the more containers share the same CPU, the context switch latency for each container increases significantly. Such increase leads to a negative impact on the network IO throughput and creates a bottleneck in the big data environments. As part of this paper, we studied container networking and various factors of context switch latency. We evaluate Hadoop benchmarks [5] against the number of containers and virtual machines. We observed a bottleneck where Hadoop [4] cluster throughput is not linear with the number of nodes sharing the same CPU. This bottleneck is due to virtual network layers which adds a significant delay to Round Trip Time (RTT) of data packets. Future work of this paper can be extended to analyze the practical implications of virtual network layers and a solution to improve the performance of big data environments based on containers. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:24 / 28
页数:5
相关论文
共 50 条
  • [31] Fuzzy rule based classification systems for big data with MapReduce: granularity analysis
    Alberto Fernández
    Sara del Río
    Abdullah Bawakid
    Francisco Herrera
    Advances in Data Analysis and Classification, 2017, 11 : 711 - 730
  • [32] Research based on big data analysis of medical industry
    Chen, Lijun
    Lin, Jiaying
    Yi, Zhang
    2019 INTERNATIONAL CONFERENCE ON ADVANCED ELECTRONIC MATERIALS, COMPUTERS AND MATERIALS ENGINEERING (AEMCME 2019), 2019, 563
  • [33] Research of network data mining based on reliability source under big data environment
    Jinhai Li
    Youshi He
    Yunlei Ma
    Neural Computing and Applications, 2017, 28 : 327 - 335
  • [34] SMART: An Application Framework for Real Time Big Data Analysis on Heterogeneous Cloud Environments
    dos Anjos, Julio C. S.
    Assuncao, Marcos D.
    Bez, Jean
    Geyer, Claudio
    de Freitas, Edison Pignaton
    Carissimi, Alexandre
    Costa, Joao Paulo C. L.
    Fedak, Gilles
    Freitag, Felix
    Markl, Volker
    Fergus, Paul
    Pereira, Rubem
    CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING, 2015, : 199 - 206
  • [35] Research of network data mining based on reliability source under big data environment
    Li, Jinhai
    He, Youshi
    Ma, Yunlei
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 : S327 - S335
  • [36] The Next-Generation NIDS Platform: Cloud-Based Snort NIDS Using Containers and Big Data
    Saputra, Ferry Astika
    Salman, Muhammad
    Hasim, Jauari Akhmad Nur
    Nadhori, Isbat Uzzin
    Ramli, Kalamullah
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (01)
  • [37] A MapReduce-based approach to social network big data mining
    Qi, Fuli
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2023, 23 (05) : 2535 - 2547
  • [38] Migration-Based Online CPSCN Big Data Analysis in Data Centers
    Li, Xin
    Wang, Liangyuan
    Lian, Zhen
    Qin, Xiaolin
    IEEE ACCESS, 2018, 6 : 19270 - 19277
  • [39] Improved Statistical Analysis Method Based on Big Data Technology
    Xu, Hongsheng
    Li, Ke
    Fan, Ganglong
    2017 INTERNATIONAL CONFERENCE ON COMPUTER NETWORK, ELECTRONIC AND AUTOMATION (ICCNEA), 2017, : 175 - 179
  • [40] Experimental Analysis on Big Data in IOT-Based Architecture
    Bera, Anupam
    Kundu, Anirban
    De Sarkar, Nivedita Ray
    Mou, De
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2, 2017, 469 : 1 - 9