Understanding the I/O Impact on the Performance of High-Throughput Molecular Docking

被引:10
作者
Markidis, Stefano [1 ,2 ]
Gadioli, Davide [2 ]
Vitali, Emanuele [2 ]
Palermo, Gianluca [2 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
[2] Politecn Milan, Milan, Italy
来源
PROCEEDINGS OF IEEE/ACM SIXTH INTERNATIONAL PARALLEL DATA SYSTEMS WORKSHOP (PDSW 2021) | 2021年
基金
瑞典研究理事会; 欧盟地平线“2020”;
关键词
High-Throughput Molecular Docking; Parallel I/O; Darshan; I/O Profiling; LIGEN;
D O I
10.1109/PDSW54622.2021.00007
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High-throughput molecular docking is a data-driven simulation methodology to estimate millions of molecules' position and interaction strength (ligands) when interacting with a given protein site. Because of its data-driven nature, the high-throughput molecular docking performance depends on how fast we can ingest data into the processing pipeline and how efficiently we can write molecular docking results to a shared file. This work characterizes the I/O performance of a high-performance, high-throughput molecular docking application, called Docker-HT, running on a supercomputer up to 512 computing nodes with two different parallel I/O configurations. We show that a tuned I/O configuration can improve the overall parallel efficiency from 71% to 90% on 512 nodes and identify and solve a performance degradation observed when running on 16 and 32 nodes.
引用
收藏
页码:9 / 14
页数:6
相关论文
共 19 条
[1]   Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19 [J].
Acharya, A. ;
Agarwal, R. ;
Baker, M. B. ;
Baudry, J. ;
Bhowmik, D. ;
Boehm, S. ;
Byler, K. G. ;
Chen, S. Y. ;
Coates, L. ;
Cooper, C. J. ;
Demerdash, O. ;
Daidone, I ;
Eblen, J. D. ;
Ellingson, S. ;
Forli, S. ;
Glaser, J. ;
Gumbart, J. C. ;
Gunnels, J. ;
Hernandez, O. ;
Irle, S. ;
Kneller, D. W. ;
Kovalevsky, A. ;
Larkin, J. ;
Lawrence, T. J. ;
LeGrand, S. ;
Liu, S-H ;
Mitchell, J. C. ;
Park, G. ;
Parks, J. M. ;
Pavlova, A. ;
Petridis, L. ;
Poole, D. ;
Pouchard, L. ;
Ramanathan, A. ;
Rogers, D. M. ;
Santos-Martins, D. ;
Scheinberg, A. ;
Sedova, A. ;
Shen, Y. ;
Smith, J. C. ;
Smith, M. D. ;
Soto, C. ;
Tsaris, A. ;
Thavappiragasam, M. ;
Tillack, A. F. ;
Vermaas, J. V. ;
Vuong, V. Q. ;
Yin, J. ;
Yoo, S. ;
Zahran, M. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (12) :5832-5852
[2]  
Ali N, 2009, 2009 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING AND WORKSHOPS, P86
[3]   Use of Experimental Design To Optimize Docking Performance: The Case of LiGenDock, the Docking Module of Ligen, a New De Novo Design Program [J].
Beato, Claudia ;
Beccari, Andrea R. ;
Cavazzoni, Carlo ;
Lorenzi, Simone ;
Costantino, Gabriele .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (06) :1503-1517
[4]   LiGen: A High Performance Workflow for Chemistry Driven de Novo Design [J].
Beccari, Andrea R. ;
Cavazzoni, Carlo ;
Beato, Claudia ;
Costantino, Gabriele .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (06) :1518-1527
[5]  
Biesiada Jacek, 2011, Hum Genomics, V5, P497
[6]  
Ellingson S.R., 2013, VinaMPI: Facilitating multiple receptor high-throughput virtual docking on high-performance computers
[7]   Tunable approximations to control time-to-solution in an HPC molecular docking Mini-App [J].
Gadioli, Davide ;
Palermo, Gianluca ;
Cherubin, Stefano ;
Vitali, Emanuele ;
Agosta, Giovanni ;
Manelfi, Candida ;
Beccari, Andrea R. ;
Cavazzoni, Carlo ;
Sanna, Nico ;
Silvano, Cristina .
JOURNAL OF SUPERCOMPUTING, 2021, 77 (01) :841-869
[8]  
Gropp William, 2014, Using Advanced MPI: Modern Features of the Message-Passing Interface
[9]   GPU-Accelerated Drug Discovery with Docking on the Summit Supercomputer: Porting, Optimization, and Application to COVID-19 Research [J].
LeGrand, Scott ;
Scheinberg, Aaron ;
Tillack, Andreas F. ;
Thavappiragasam, Mathialakan ;
Vermaas, Josh V. ;
Agarwal, Rupesh ;
Larkin, Jeff ;
Poole, Duncan ;
Santos-Martins, Diogo ;
Solis-Vasquez, Leonardo ;
Koch, Andreas ;
Forli, Stefano ;
Hernandez, Oscar ;
Smith, Jeremy C. ;
Sedova, Ada .
ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
[10]   Open Babel: An open chemical toolbox [J].
O'Boyle, Noel M. ;
Banck, Michael ;
James, Craig A. ;
Morley, Chris ;
Vandermeersch, Tim ;
Hutchison, Geoffrey R. .
JOURNAL OF CHEMINFORMATICS, 2011, 3