Portable and Reusable Deep Learning Infrastructure with Containers to Accelerate Cancer Studies

被引:2
作者
Zaki, George F. [1 ]
Wozniak, Justin M. [2 ,3 ]
Ozik, Jonathan [3 ,4 ]
Collier, Nicholson [3 ,4 ]
Brettin, Thomas [5 ]
Stevens, Rick [3 ,5 ]
机构
[1] Frederick Natl Lab Canc Res, Frederick, MD 21701 USA
[2] Argonne Natl Lab, Data Sci & Learning, 9700 S Cass Ave, Argonne, IL 60439 USA
[3] Univ Chicago, Chicago, IL 60637 USA
[4] Argonne Natl Lab, Decis & Infrastruct Sci, 9700 S Cass Ave, Argonne, IL 60439 USA
[5] Argonne Natl Lab, Comp Environm & Life Sci, 9700 S Cass Ave, Argonne, IL 60439 USA
来源
PROCEEDINGS OF 2018 IEEE/ACM 4TH INTERNATIONAL WORKSHOP ON EXTREME SCALE PROGRAMMING MODELS AND MIDDLEWARE (ESPM2 2018) | 2018年
关键词
D O I
10.1109/ESPM2.2018.00011
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Advanced programming models, domain specific languages, and scripting toolkits have the potential to greatly accelerate the adoption of high performance computing. These complex software systems, however, are often difficult to install and maintain, especially on exotic high-end systems. We consider deep learning workflows used on petascale systems and redeployment on research clusters using containers. Containers are used to deploy the MPI-based infrastructure, but challenges in efficiency, usability, and complexity must be overcome. In this work, we address these challenges through enhancements to a unified workflow system that manages interaction with the container abstraction, the cluster scheduler, and the programming tools. We also report results from running the application on our system, harnessing 298 TFLOPS (single precision).
引用
收藏
页码:54 / 61
页数:8
相关论文
共 12 条
[1]  
Armstrong T. G., 2014, P SC
[2]  
Bergstra JS., 2011, ADV NEURAL INFORM PR, P2546
[3]  
Bischl B., 2017, mlrMBO: A Modular Framework for Model-Based Optimization of Expensive Black-Box Functions
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]  
Felter W, 2015, INT SYM PERFORM ANAL, P171, DOI 10.1109/ISPASS.2015.7095802
[6]  
Hutter Frank, 2012, Learning and Intelligent Optimization. 6th International Conference, LION 6. Revised Selected Papers, P55, DOI 10.1007/978-3-642-34413-8_5
[7]   Singularity: Scientific containers for mobility of compute [J].
Kurtzer, Gregory M. ;
Sochat, Vanessa ;
Bauer, Michael W. .
PLOS ONE, 2017, 12 (05)
[8]  
Ousterhout John K, 1998, IEEE COMPUTER
[9]  
Ozik Jonathan, 2016, P WINT SIM C
[10]   U-Net: Convolutional Networks for Biomedical Image Segmentation [J].
Ronneberger, Olaf ;
Fischer, Philipp ;
Brox, Thomas .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241