Analysis of a Network IO Bottleneck in Big Data Environments Based on Docker Containers

被引:8
|
作者
Varma, P. China Venkanna
Chakravarthy, K. Venkata Kalyan
Kumari, V. Valli
Raju, S. Viswanadha
机构
关键词
Containers; Context switching; Docker; Hadoop; Map reduce;
D O I
10.1016/j.bdr.2015.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We live in a world increasingly driven by data with more information about individuals, companies and governments available than ever before. Now, every business is powered by Information Technology and generating Big data. Future Business Intelligence can be extracted from the big data. NoSQL [1] and Map-Reduce [2] technologies find an efficient way to store, organize and process the big data using Virtualization and Linux Container (a.k.a. Container) [3] technologies. Provisioning containers on top of virtual machines is a better model for high resource utilization. As the more containers share the same CPU, the context switch latency for each container increases significantly. Such increase leads to a negative impact on the network IO throughput and creates a bottleneck in the big data environments. As part of this paper, we studied container networking and various factors of context switch latency. We evaluate Hadoop benchmarks [5] against the number of containers and virtual machines. We observed a bottleneck where Hadoop [4] cluster throughput is not linear with the number of nodes sharing the same CPU. This bottleneck is due to virtual network layers which adds a significant delay to Round Trip Time (RTT) of data packets. Future work of this paper can be extended to analyze the practical implications of virtual network layers and a solution to improve the performance of big data environments based on containers. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:24 / 28
页数:5
相关论文
共 50 条
  • [21] Network Security Analysis Using Big Data Technology
    Bachupally, Yogeshwar Rao
    Yuan, Xiaohong
    Roy, Kaushik
    SOUTHEASTCON 2016, 2016,
  • [22] Design and analysis of management platform based on financial big data
    Chen Y.
    Mustafa H.
    Zhang X.
    Liu J.
    PeerJ Computer Science, 2023, 9
  • [23] Efficient Service Discovery Using Social Service Network Based on Big Data Infrastructure
    Paik, Incheon
    Koshiba, Yutaka
    Siriweera, T. H. Akila S.
    2017 IEEE 11TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2017), 2017, : 166 - 173
  • [24] A Performance Analysis of MapReduce Applications on Big Data in Cloud based Hadoop
    Gohil, Parth
    Garg, Dweepna
    Panchal, Bakul
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [25] Big Data Analysis of Reviews on E-commerce Based on Hadoop
    Zu, Qiaohong
    Wu, Jiangming
    HUMAN CENTERED COMPUTING, HCC 2017, 2018, 10745 : 492 - 502
  • [26] Research and Practice of Big Data Analysis Process Based on Hadoop Framework
    Jiang, Hui
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2044 - 2047
  • [27] REMOTE SENSING POWERED CONTAINERS FOR BIG DATA AND AI/ML ANALYSIS: ACCELERATING SCIENCE, STANDARDIZING OPERATIONS
    Caraballo-Vega, Jordan A.
    Smith, Noah S. Oller
    Carroll, Mark L.
    Carriere, Laura
    Jasen, John E.
    Le, Minh T.
    Li, Jian
    Peck, Kenneth
    Strong, Savannah L.
    Tamkin, Glenn S.
    Thompson, Matthew A.
    Thompson, John H.
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 4034 - 4037
  • [28] Technological Surveillance in Big Data Environments by using a MapReduce-based Method
    Daniel San Martin Pascal Filho
    Douglas Dyllon Jeronimo de Macedo
    Moisés Lima Dutra
    Mobile Networks and Applications, 2022, 27 : 1931 - 1940
  • [29] Technological Surveillance in Big Data Environments by using a MapReduce-based Method
    Pascal Filho, Daniel San Martin
    Jeronimo de Macedo, Douglas Dyllon
    Dutra, Moises Lima
    MOBILE NETWORKS & APPLICATIONS, 2022, 27 (05) : 1931 - 1940
  • [30] Fuzzy rule based classification systems for big data with MapReduce: granularity analysis
    Fernandez, Alberto
    del Rio, Sara
    Bawakid, Abdullah
    Herrera, Francisco
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2017, 11 (04) : 711 - 730