Analysis of a Network IO Bottleneck in Big Data Environments Based on Docker Containers

被引:8
|
作者
Varma, P. China Venkanna
Chakravarthy, K. Venkata Kalyan
Kumari, V. Valli
Raju, S. Viswanadha
机构
关键词
Containers; Context switching; Docker; Hadoop; Map reduce;
D O I
10.1016/j.bdr.2015.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We live in a world increasingly driven by data with more information about individuals, companies and governments available than ever before. Now, every business is powered by Information Technology and generating Big data. Future Business Intelligence can be extracted from the big data. NoSQL [1] and Map-Reduce [2] technologies find an efficient way to store, organize and process the big data using Virtualization and Linux Container (a.k.a. Container) [3] technologies. Provisioning containers on top of virtual machines is a better model for high resource utilization. As the more containers share the same CPU, the context switch latency for each container increases significantly. Such increase leads to a negative impact on the network IO throughput and creates a bottleneck in the big data environments. As part of this paper, we studied container networking and various factors of context switch latency. We evaluate Hadoop benchmarks [5] against the number of containers and virtual machines. We observed a bottleneck where Hadoop [4] cluster throughput is not linear with the number of nodes sharing the same CPU. This bottleneck is due to virtual network layers which adds a significant delay to Round Trip Time (RTT) of data packets. Future work of this paper can be extended to analyze the practical implications of virtual network layers and a solution to improve the performance of big data environments based on containers. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:24 / 28
页数:5
相关论文
共 50 条
  • [1] Forensic Analysis of Cryptojacking in Host-based Docker Containers Using Honeypots
    Franco, Javier
    Acar, Abbas
    Aris, Ahmet
    Uluagac, Selcuk
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4860 - 4865
  • [2] Docker Container-Based Big Data Processing System in Multiple Clouds for Everyone
    Naik, Nitin
    2017 IEEE INTERNATIONAL SYMPOSIUM ON SYSTEMS ENGINEERING (ISSE 2017), 2017, : 276 - 282
  • [3] Performance Analysis of a 5G User Plane Function accelerated with eXpress Data Path in Docker Containers
    Scheich, Christian
    Corici, Marius
    Buhr, Hauke
    Magedanz, Thomas
    2023 IEEE FUTURE NETWORKS WORLD FORUM, FNWF, 2024,
  • [4] SLA based healthcare big data analysis and computing in cloud network
    Sahoo, Prasan Kumar
    Mohapatra, Suvendu Kumar
    Wu, Shih-Lin
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 119 : 121 - 135
  • [5] Load balancing and service discovery using Docker Swarm for microservice based big data applications
    Neelam Singh
    Yasir Hamid
    Sapna Juneja
    Gautam Srivastava
    Gaurav Dhiman
    Thippa Reddy Gadekallu
    Mohd Asif Shah
    Journal of Cloud Computing, 12
  • [6] Load balancing and service discovery using Docker Swarm for microservice based big data applications
    Singh, Neelam
    Hamid, Yasir
    Juneja, Sapna
    Srivastava, Gautam
    Dhiman, Gaurav
    Gadekallu, Thippa Reddy
    Shah, Mohd Asif
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2023, 12 (01):
  • [7] Implementation of docker-based smart greenhouse data analysis platform
    Hyun, Wook
    Huh, Mi Young
    Park, Juyoung
    2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1103 - 1106
  • [8] Design and Implementation of Network User Behaviors Analysis Based on Hadoop for Big Data
    Guan, Jianfeng
    Yao, Su
    Xu, Changqiao
    Zhang, Hongke
    APPLICATIONS AND TECHNIQUES IN INFORMATION SECURITY, ATIS 2014, 2014, 490 : 44 - 55
  • [9] Network Precision Marketing Strategy Of Agricultural Products Based On Big Data Analysis
    Hu, Wei
    Wu, Qiu
    2019 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2019, : 366 - 369
  • [10] Big data technologies for image retrieval and analysis in web environments
    Rodriguez-Vaamonde, Sergio
    Torre-Bastida, Ana-Isabel
    Garrote, Estibaliz
    PROFESIONAL DE LA INFORMACION, 2014, 23 (06): : 567 - 574