Inverse folding with RNA-As-Graphs produces a large pool of candidate sequences with target topologies

被引:16
作者
Jain, Swati [1 ]
Tao, Yunwen [1 ,4 ]
Schlick, Tamar [1 ,2 ,3 ]
机构
[1] NYU, Dept Chem, 1021 Silver,100 Washington Sq East, New York, NY 10003 USA
[2] New York Univ, Courant Inst Math Sci, 251 Mercer St, New York, NY 10012 USA
[3] New York Univ Shanghai, NYU ECNU Ctr Computat Chem, Room 340,Geog Bldg,North Zhongshan Rd, Shanghai 3663, Peoples R China
[4] Southern Methodist Univ, Dept Chem, 3215 Daniel Ave, Dallas, TX 75275 USA
基金
美国国家卫生研究院;
关键词
RNA As Graphs; RNA sequence design; RNA-like topology; Genetic algorithm; Automatic mutations; SECONDARY STRUCTURE; DESIGN; CHALLENGES; ALGORITHM; MOTIFS;
D O I
10.1016/j.jsb.2019.107438
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present an RNA-As-Graphs (RAG) based inverse folding algorithm, RAG-IF, to design novel RNA sequences that fold onto target tree graph topologies. The algorithm can be used to enhance our recently reported computational design pipeline (Jain et al., NAR 2018). The RAG approach represents RNA secondary structures as tree and dual graphs, where RNA loops and helices are coarse-grained as vertices and edges, opening the usage of graph theory methods to study, predict, and design RNA structures. Our recently developed computational pipeline for design utilizes graph partitioning (RAG-3D) and atomic fragment assembly (F-RAG) to design sequences to fold onto RNA-like tree graph topologies; the atomic fragments are taken from existing RNA structures that correspond to tree subgraphs. Because F-RAG may not produce the target folds for all designs, automated mutations by RAG-IF algorithm enhance the candidate pool markedly. The crucial residues for mutation are identified by differences between the predicted and the target topology. A genetic algorithm then mutates the selected residues, and the successful sequences are optimized to retain only the minimal or essential mutations. Here we evaluate RAG-IF for 6 RNA-like topologies and generate a large pool of successful candidate sequences with a variety of minimal mutations. We find that RAG-IF adds robustness and efficiency to our RNA design pipeline, making inverse folding motivated by graph topology rather than secondary structure more productive.
引用
收藏
页数:16
相关论文
共 43 条
[1]   A new algorithm for RNA secondary structure design [J].
Andronescu, M ;
Fejes, AP ;
Hutter, F ;
Hoos, HH ;
Condon, A .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 336 (03) :607-624
[2]  
[Anonymous], 2012, BIOPHYSICS RNA FOLDI
[3]   RNAiFOLD: A CONSTRAINT PROGRAMMING ALGORITHM FOR RNA INVERSE FOLDING AND MOLECULAR DESIGN [J].
Antonio Garcia-Martin, Juan ;
Clote, Peter ;
Dotu, Ivan .
JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2013, 11 (02)
[4]   Predicting Large RNA-Like Topologies by a Knowledge-Based Clustering Approach [J].
Baba, Naoto ;
Elmetwaly, Shereef ;
Kim, Namhee ;
Schlick, Tamar .
JOURNAL OF MOLECULAR BIOLOGY, 2016, 428 (05) :811-821
[5]   Using sequence signatures and kink-turn motifs in knowledge-based statistical potentials for RNA structure prediction [J].
Bayrak, Cigdem Sevim ;
Kim, Namhee ;
Schlick, Tamar .
NUCLEIC ACIDS RESEARCH, 2017, 45 (09) :5414-5422
[6]   INFO-RNA - a fast approach to inverse RNA folding [J].
Busch, Anke ;
Backofen, Rolf .
BIOINFORMATICS, 2006, 22 (15) :1823-1831
[7]   Design of RNAs: comparing programs for inverse RNA folding [J].
Churkin, Alexander ;
Retwitzer, Matan Drory ;
Reinharz, Vladimir ;
Ponty, Yann ;
Waldispuhl, Jerome ;
Barash, Danny .
BRIEFINGS IN BIOINFORMATICS, 2018, 19 (02) :350-358
[8]   CENTRAL DOGMA OF MOLECULAR BIOLOGY [J].
CRICK, F .
NATURE, 1970, 227 (5258) :561-&
[9]   Review of CRISPR/Cas9 sgRNA Design Tools [J].
Cui, Yingbo ;
Xu, Jiaming ;
Cheng, Minxia ;
Liao, Xiangke ;
Peng, Shaoliang .
INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2018, 10 (02) :455-465
[10]   Coarse-grained modeling of RNA 3D structure [J].
Dawson, Wayne K. ;
Maciejczyk, Maciej ;
Jankowska, Elzbieta J. ;
Bujnicki, Janusz M. .
METHODS, 2016, 103 :138-156