Distributing Deep Learning Hyperparameter Tuning for 3D Medical Image Segmentation

被引:3
作者
Berral, Josep Ll [1 ]
Aranda, Oriol [1 ]
Luis Dominguez, Juan [1 ]
Torres, Jordi [1 ]
机构
[1] Univ Politecn Cataluna, Barcelona Supercomp Ctr, Barcelona, Spain
来源
2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022) | 2022年
基金
欧盟地平线“2020”;
关键词
Distributed Deep Learning; Distributed Computing; GPU; Parallelism; Scalability; OPTIMIZATION;
D O I
10.1109/IPDPSW55747.2022.00172
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Most research on novel techniques for 3D Medical Image Segmentation (MIS) is currently done using Deep Learning with GPU accelerators. The principal challenge of such technique is that a single input can easily cope computing resources, and require prohibitive amounts of time to be processed. Distribution of deep learning and scalability over computing devices is an actual need for progressing on such research field. Conventional distribution of neural networks consist in "data parallelism", where data is scattered over resources (e.g., GPUs) to parallelize the training of the model. However, "experiment parallelism" is also an option, where different training processes (i.e., on a hyper-parameter search) are parallelized across resources. While the first option is much more common on 3D image segmentation, the second provides a pipeline design with less dependence among parallelized processes, allowing overhead reduction and more potential scalability. In this work we present a design for distributed deep learning training pipelines, focusing on multinode and multi-GPU environments, where the two different distribution approaches are deployed and benchmarked. We take as proof of concept the 3D U-Net architecture, using the MSD Brain Tumor Segmentation dataset, a state-of-art problem in medical image segmentation with high computing and space requirements. Using the BSC MareNostrum supercomputer as benchmarking environment, we use TensorFlow and Ray as neural network training and experiment distribution platforms. We evaluate the experiment speed-up when parallelizing, showing the potential for scaling out on GPUs and nodes. Also comparing the different parallelism techniques, showing how experiment distribution leverages better such resources through scaling, e.g. by a speed-up factor from x12 to x14 using 32 GPUs. Finally, we provide the implementation of the design open to the community, and the non-trivial steps and methodology for adapting and deploying a MIS case as the here presented.
引用
收藏
页码:1045 / 1052
页数:8
相关论文
共 45 条
[1]  
Akintoye S. B., 2021, HYBRID PARALLELIZATI
[2]   Probabilistic segmentation of brain tissue in MR imaging [J].
Anbeek, P ;
Vincken, KL ;
van Bochove, GS ;
van Osch, MJP ;
van der Grond, J .
NEUROIMAGE, 2005, 27 (04) :795-804
[3]  
[Anonymous], 2019, arXiv Preprint arXiv
[4]  
Bergstra J., 2011, ADV NEURAL INFORM PR, V24, P2546
[5]   Automatic Hyperparameter Optimization for Transfer Learning on Medical Image Datasets Using Bayesian Optimization [J].
Borgli, Rune Johan ;
Stensland, Hakon Kvale ;
Riegler, Michael Alexander ;
Halvorsen, Pal .
2019 13TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION AND COMMUNICATION TECHNOLOGY (ISMICT), 2019, :175-180
[6]   Distributed training strategies for a computer vision deep learning algorithm on a distributed GPU cluster [J].
Campos, Victor ;
Sastre, Francesc ;
Yagues, Maurici ;
Bellver, Miriam ;
Giro-i-Nieto, Xavier ;
Torres, Jordi .
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 :315-324
[7]   Performance measure characterization for evaluating neuroimage segmentation algorithms [J].
Chang, Herng-Hua ;
Zhuang, Audrey H. ;
Valentino, Daniel J. ;
Chu, Woei-Chyn .
NEUROIMAGE, 2009, 47 (01) :122-135
[8]  
Chang J, 2018, 2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018)
[9]   Big Data Deep Learning: Challenges and Perspectives [J].
Chen, Xue-Wen ;
Lin, Xiaotong .
IEEE ACCESS, 2014, 2 :514-525
[10]  
Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49