Taking RNA-RNA Interaction to Machine Peak

被引:0
作者
Mondal, Chiranjeb [1 ]
Rajopadhye, Sanjay [1 ]
机构
[1] Colorado State Univ, Ft Collins, CO 80523 USA
关键词
BPMax; polyhedral compilation; RRI; PARTITION-FUNCTION ALGORITHM; AFFINE SCHEDULING PROBLEM; EFFICIENT SOLUTIONS; PREDICTION;
D O I
10.1109/TPDS.2024.3380443
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
RNA-RNA interactions (RRIs) are essential in many biological processes, including gene transcription, translation, and localization. They play a critical role in diseases such as cancer and Alzheimer's. Algorithms to model RRI typically use dynamic programming and have the complexity Theta((NM3)-M-3)in time and Theta((NM2)-M-2)in space where N and M are the lengths of the twoRNAsequences.Thismakesitbothessentialandchallengingtopar-allelize them. Previous efforts to do so have been hand-optimized, which is prone to human error and costly to develop and maintain. This paper presents a multi-core CPU parallelization of BP Max, one of the simpler RRI algorithms, generated by a user-guided polyhedral code generation tool, Alpha Z. The user starts with a mathematical specification of the dynamic programming algorithm and provides the choice of polyhedral program transformations such as schedules, memory-maps, and multi-level tiling. Alpha Z automatically generates highly optimized code. At the lowest level, we implemented a small hand-optimized register-tiled "matrix max-plus" kernel and integrated it with our tool-generated optimized code. Our final optimized program version is about 400 x faster than the base program, translating to around 312 GFLOPS, more than half of our platform's Roofline Machine Peak(RMP)performance. On a single core, we attain 80% of RMP. The main kernel in the algorithm, whose complexity is Theta((NM3)-M-3), attains58 GFLOPS on a single-core and 344 GFLOPS on multi-core (90%and 58% of RMP, respectively).
引用
收藏
页码:737 / 749
页数:13
相关论文
共 40 条
[1]   RNA-RNA interaction prediction and antisense RNA target search [J].
Alkan, C ;
Karakoç, E ;
Nadeau, JH ;
Sahinalp, SC ;
Zhang, KH .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (02) :267-282
[2]   Secondary structure prediction of interacting RNA molecules [J].
Andronescu, M ;
Zhang, ZC ;
Condon, A .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 345 (05) :987-1001
[3]  
Bondhugula U., 2007, Tech. Rep. OSU-CISRC-10/07-TR70
[4]   A practical automatic polyhedral parallelizer and locality optimizer [J].
Bondhugula, Uday ;
Hartono, Albert ;
Ramanujam, J. ;
Sadayappan, P. .
ACM SIGPLAN NOTICES, 2008, 43 (06) :101-113
[5]  
Chen Chun, 2008, Technical Report 08-897, USC Computer Science Technical Report
[6]   A partition function algorithm for interacting nucleic acid strands [J].
Chitsaz, Hamidreza ;
Salari, Raheleh ;
Sahinalp, S. Cenk ;
Backofen, Rolf .
BIOINFORMATICS, 2009, 25 (12) :I365-I373
[7]  
de Dinechin F., 1995, Proceedings 1995. Programming Models for Massively Parallel Computers (Cat. No.95TB8112), P18, DOI 10.1109/PMMPC.1995.504337
[8]   Hierarchical static analysis of structured systems of affine recurrence equations [J].
deDinechin, F ;
Robert, S .
INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS 1996, PROCEEDINGS, 1996, :381-390
[9]   A Bayesian statistical algorithm for RNA secondary structure prediction [J].
Ding, Y ;
Lawrence, CE .
COMPUTERS & CHEMISTRY, 1999, 23 (3-4) :387-400
[10]   A partition function algorithm for nucleic acid secondary structure including pseudoknots [J].
Dirks, RM ;
Pierce, NA .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2003, 24 (13) :1664-1677