Provision and use of GPU resources for distributed workloads via the Grid

被引:0
作者
Traynor, Daniel [1 ]
Froy, Terry [1 ]
机构
[1] Queen Mary Univ London, Mile End Rd, London E1 4NS, England
来源
24TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2019) | 2020年 / 245卷
关键词
D O I
10.1051/epjconf/202024503002
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Queen Mary University of London WLCG Tier-2 Grid site has been providing GPU resources on the Grid since 2016. GPUs are an important modern tool to assist in data analysis. They have historically been used to accelerate computationally expensive but parallelisable workloads using frameworks such as OpenCL and CUDA. However, more recently their power in accelerating machine learning, using libraries such as TensorFlow and Coffee, has come to the fore and the demand for GPU resources has increased. Significant effort is being spent in high energy physics to investigate and use machine learning to enhance the analysis of data. GPUs may also provide part of the solution to the compute challenge of the High Luminosity LHC. The motivation for providing GPU resources via the Grid is presented. The installation and configuration of the SLURM batch system together with Compute Elements (CREAM and ARC) for use with GPUs is shown. Real world use cases are presented and the success and issues discovered are discussed.
引用
收藏
页数:5
相关论文
共 11 条
[1]  
Blyth Simon, PLENARY TALK CHEP201
[2]  
Bonvin Alexandre M. J. J., 2010, Journal of Structural and Functional Genomics, V11, P1, DOI 10.1007/s10969-010-9084-9
[3]   A readout chip for a 64 x 64 pixel matrix with 15-bit single Photon Counting [J].
Campbell, M ;
Heijne, EHM ;
Meddeler, G ;
Pernigotti, E ;
Snoeys, W .
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 1998, 45 (03) :751-753
[4]   Photon tracking with GPUs in IceCube [J].
Chirkin, Dmitry .
NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2013, 725 :141-143
[5]   Advanced Resource Connector middleware for lightweight computational Grids [J].
Ellert, M. ;
Gronager, M. ;
Konstantinov, A. ;
Konya, B. ;
Lindemann, J. ;
Livenson, I. ;
Nielsen, J. L. ;
Niinimaki, A. ;
Smirnova, O. ;
Waananen, A. .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2007, 23 (02) :219-240
[6]  
Forti Alessandra, 2019, ATLS OFTSLIDE
[7]   Enabling rootless Linux Containers in multi-user environments: The udocker tool [J].
Gomes, Jorge ;
Bagnaschi, Emanuele ;
Campos, Isabel ;
David, Mario ;
Alves, Luis ;
Martins, Joao ;
Pina, Joao ;
Lopez-Garcia, Alvaro ;
Orviz, Pablo .
COMPUTER PHYSICS COMMUNICATIONS, 2018, 232 :84-97
[8]   The LUCID-Timepix spacecraft payload and the CERN@school educational programme [J].
Hatfield, P. ;
Furnell, W. ;
Shenoy, A. ;
Fox, E. ;
Parker, R. ;
Thomas, L. .
JOURNAL OF INSTRUMENTATION, 2018, 13
[9]  
Hays Jonathan, MACHINE LEARNING MON
[10]  
Whyntie T., 2013, Physics Education, V48, P344, DOI 10.1088/0031-9120/48/3/344