On the Use of Containers in High Performance Computing Environments

被引:17
作者
Abraham, Subil [1 ]
Paul, Arnab K. [1 ]
Khan, Redwan Ibne Seraj [1 ]
Butt, Ali R. [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
来源
2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020) | 2020年
基金
美国国家科学基金会;
关键词
Container Performance; High Performance Computing; Parallel File Systems; HPC Storage and I/O;
D O I
10.1109/CLOUD49709.2020.00048
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The lightweight nature, application portability, and deployment flexibility of containers is driving their widespread adoption in cloud solutions. Data analysis and deep learning (DL)/machine learning (ML) applications have especially benefited from containerization. As such data analysis is adopted in high performance computing (HPC), the need for container support in HPC has become paramount. However, containers face crucial performance and I/O challenges in HPC. One obstacle is that while there have been HPC containers, such solutions have not been thoroughly investigated, especially from the aspect of their impact on the crucial HPC I/O throughput. To this end, this paper provides a first-of-its-kind empirical analysis of state-of-the-art representative container solutions (Docker, Podman, Singularity, and Charliecloud) in HPC environments. We also explore how containers interact with an HPC parallel file system like Lustre. We present the design of an analysis framework that is deployed on all nodes in an HPC environment, and captures CPU, memory, network, and file I/O statistics from the nodes and the storage system. We are able to garner key insights from our analysis, e.g., Charliecloud outperforms other container solutions in terms of container start-up time, while Singularity and Charliecloud are equivalent in I/O throughput. But this comes at a cost, as Charliecloud invokes the most metadata and I/O operations on the underlying Lustre file system. By identifying such trade-offs and optimization opportunities, we can enhance HPC containers performance and the ML/DL applications that increasingly rely on them.
引用
收藏
页码:284 / 293
页数:10
相关论文
共 33 条
[1]  
[Anonymous], 2012 SC COMPANION HI
[2]  
[Anonymous], 2016, UMOCI STANDALONE TOO
[3]  
[Anonymous], 2015, P 6 WORKSH SCI CLOUD
[4]  
Arango C., 2017, ARXIV170910140CS
[5]   Enabling HPC workloads on Cloud Infrastructure using Kubernetes Container Orchestration Mechanisms [J].
Beltre, Angel ;
Saha, Pankaj ;
Govindaraju, Madhusudhan ;
Younge, Andrew J. ;
Grant, Ryan Eric .
PROCEEDINGS OF CANOPIE-HPC 2019:2019 IEEE/ACM 1ST INTERNATIONAL WORKSHOP ON CONTAINERS AND NEW ORCHESTRATION PARADIGMS FOR ISOLATED ENVIRONMENTS IN HPC (CANOPIE-HPC), 2019, :11-20
[6]   Ookami: Deployment and Initial Experiences [J].
Burford, Andrew ;
Calder, Alan C. ;
Carlson, David ;
Chapman, Barbara ;
Coskun, Firat ;
Curtis, Tony ;
Feldman, Catherine ;
Harrison, Robert J. ;
Kang, Yan ;
Michalowicz, Benjamin ;
Raut, Eric ;
Siegmann, Eva ;
Wood, Daniel G. ;
DeLeon, Robert L. ;
Jones, Mathew ;
Simakov, Nikolay A. ;
White, Joseph P. ;
Oryspayev, Dossay .
PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2021, PEARC 2021, 2021,
[7]   DLHub: Model and Data Serving for Science [J].
Chard, Ryan ;
Li, Zhuozhao ;
Chard, Kyle ;
Ward, Logan ;
Babuji, Yadu ;
Woodard, Anna ;
Tuecke, Steven ;
Blaiszik, Ben ;
Franklin, Michael J. ;
Foster, Ian .
2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, :283-292
[8]   6 Deep Learning in Drug Discovery [J].
Gawehn, Erik ;
Hiss, Jan A. ;
Schneider, Gisbert .
MOLECULAR INFORMATICS, 2016, 35 (01) :3-14
[9]  
Gerhardt Lisa, 2017, Journal of Physics: Conference Series, V898, DOI 10.1088/1742-6596/898/8/082021
[10]   Biomedical Image Processing with Containers and Deep Learning: An Automated Analysis Pipeline: Data architecture, artificial intelligence, automated processing, containerization, and clusters orchestration ease the transition from data acquisition to insights in medium-to-large datasets [J].
Gonzalez, German ;
Evans, Conor L. .
BIOESSAYS, 2019, 41 (06)