Parallelization of large-scale Drug-Protein binding experiments

被引:1
作者
Makris, Antonios [1 ]
Michail, Dimitrios [1 ]
Varlamis, Iraklis [1 ]
Dimitropoulos, Chronis [1 ]
Tserpes, Konstantinos [1 ]
Tsatsaronis, George [2 ,3 ]
Haupt, Joachim [2 ]
Sawyer, Mark [4 ]
机构
[1] Harokopio Univ, Dept Informat & Telemat, Athens, Greece
[2] Tech Univ Dresden, Dresden, Germany
[3] Elsevier, Amsterdam, Netherlands
[4] Univ Edinburgh, EPCC, Edinburgh, Midlothian, Scotland
来源
2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS) | 2017年
关键词
SUBGRAPH ISOMORPHISM; ALGORITHM; SITES; DISCOVERY; ALIGNMENT;
D O I
10.1109/HPCS.2017.39
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Drug polypharmacology or "drug promiscuity" refers to the ability of a drug to bind multiple proteins. Such studies have huge impact to the pharmaceutical industry, but in the same time require large investments on wet-lab experiments. The respective in-silico experiments have a significantly smaller cost and minimize the expenses for the subsequent lab experiments. However, the process of finding similar protein targets for an existing drug, passes through protein structural similarity and is a highly demanding in computational resources task. In this work, we propose several algorithms that port the protein similarity task to a parallel high-performance computing environment. The differences in size and complexity of the examined protein structures raise several issues in a naive parallelization process that significantly affect the overall time and required memory. We describe several optimizations for better memory and CPU balancing which achieve faster execution times. Experimental results, on a high-performance computing environment with 512 cores and 2048GB of memory, demonstrate the effectiveness of our approach which scales well to large amounts of protein pairs.
引用
收藏
页码:201 / 208
页数:8
相关论文
共 27 条
[1]   BigBWA: approaching the Burrows-Wheeler aligner to Big Data technologies [J].
Abuin, Jose M. ;
Pichel, Juan C. ;
Pena, Tomas F. ;
Amigo, Jorge .
BIOINFORMATICS, 2015, 31 (24) :4003-4005
[2]  
[Anonymous], EUR PAR VIRT MACH
[3]   The Quickhull algorithm for convex hulls [J].
Barber, CB ;
Dobkin, DP ;
Huhdanpaa, H .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1996, 22 (04) :469-483
[4]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]   Resource-constrained project scheduling: Notation, classification, models, and methods [J].
Brucker, P ;
Drexl, A ;
Mohring, R ;
Neumann, K ;
Pesch, E .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1999, 112 (01) :3-41
[6]   A (sub)graph isomorphism algorithm for matching large graphs [J].
Cordella, LP ;
Foggia, P ;
Sansone, C ;
Vento, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (10) :1367-1372
[7]  
Dinan J, 2009, P C HIGH PERF COMP N, P1, DOI DOI 10.1145/1654059.1654113
[8]   3-DIMENSIONAL ALPHA-SHAPES [J].
EDELSBRUNNER, H ;
MUCKE, EP .
ACM TRANSACTIONS ON GRAPHICS, 1994, 13 (01) :43-72
[9]   The implementation of the Cilk-5 multithreaded language [J].
Frigo, M ;
Leiserson, CE ;
Randall, KH .
ACM SIGPLAN NOTICES, 1998, 33 (05) :212-223
[10]  
Haupt V. J., 2011, BRIEF BIOINFORM