RNAknot: A new algorithm for RNA secondary structure prediction based on genetic algorithm and GRASP method

被引:2
作者
El Fatmi, Abdelhakim [1 ]
Bekri, M. Ali [1 ]
Benhlima, Said [1 ]
机构
[1] Moulay Ismail Univ, Fac Sci, Comp Sci Dept, MACS Lab, BP 11201, Meknes, Morocco
关键词
Ribonucleic acid (RNA); RNA secondary structure; genetic algorithm; GRASP method; FOLDING ALGORITHM; WEB SERVER; PSEUDOKNOTS;
D O I
10.1142/S0219720019500318
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The prediction of the optimal secondary structure for a given RNA sequence represents a challenging computational problem in bioinformatics. This challenge becomes harder especially with the discovery of different pseudoknot classes, which is a complex topology that plays diverse roles in biological processes. Many recent studies have been proposed to predict RNA secondary structure with some pseudoknot classes, but only a few of them have reached satisfying results in terms of both complexity and accuracy. Here we present RNAknot, a new method for predicting RNA secondary structure that contains the following components: stems, hairpin loops, multi-branched loops or multi-loops, bulge loops, and internal loops, in addition to two types of pseudoknots, H-type pseudoknot and Hairpin kissing. RNAknot is based on a genetic algorithm and Greedy Randomized Adaptive Search Procedure (GRASP), and it uses the free energy as fitness function to evaluate the obtained structures. In order to validate the performance of the presented method 131 tests have been performed using two datasets of 26 and 105 RNA sequences, which have been taken from the two data bases RNAstrand and Pseudobase respectively. The obtained results are compared with those of some RNA secondary structure prediction programs such as Vs_subopt, CyloFold, IPknot, Kinefold, RNAstructure, and Sfold. The results of this comparative study show that the prediction accuracy of our proposed approach is significantly improved compared to those obtained by the other programs. For the first dataset, RNAknot has the highest specificity (SP) (71.23%) and sensitivity (SN) (72.15%) averages compared to the other programs. Concerning the second dataset, the RNA secondary structure predictions obtained by the RNAknot correspond to the highest averages of SP (85.49%) and F-measure (79.97%) compared to the other programs. The program is available as a jar file in the link: www.bachmek.umi.ac.ma/wp-content/uploads/RNAknot .0.0 .2. rar.
引用
收藏
页数:19
相关论文
共 29 条
[1]   RNA STRAND: The RNA secondary structure and statistical analysis database [J].
Andronescu, Mirela ;
Bereg, Vera ;
Hoos, Holger H. ;
Condon, Anne .
BMC BIOINFORMATICS, 2008, 9 (1)
[2]  
[Anonymous], URBANA
[3]  
Beg AH, 2016, C IND ELECT APPL, P2478, DOI 10.1109/ICIEA.2016.7604009
[4]   CyloFold: secondary structure prediction including pseudoknots [J].
Bindewald, Eckart ;
Kluth, Tanner ;
Shapiro, Bruce A. .
NUCLEIC ACIDS RESEARCH, 2010, 38 :W368-W372
[5]   Viral RNA pseudoknots: versatile motifs in gene expression and replication [J].
Brierley, Ian ;
Pennell, Simon ;
Gilbert, Robert J. C. .
NATURE REVIEWS MICROBIOLOGY, 2007, 5 (08) :598-610
[6]   Evaluation of gene structure prediction programs [J].
Burset, M ;
Guigo, R .
GENOMICS, 1996, 34 (03) :353-367
[7]   Optimization of an RNA folding algorithm for parallel architectures [J].
Chen, JH ;
Le, SY ;
Shapiro, BA ;
Maizel, JV .
PARALLEL COMPUTING, 1998, 24 (11) :1617-1634
[8]  
Dawson W, 2014, J NUCL ACIDS INVEST, V5
[9]   GREEDY RANDOMIZED ADAPTIVE SEARCH PROCEDURES [J].
FEO, TA ;
RESENDE, MGC .
JOURNAL OF GLOBAL OPTIMIZATION, 1995, 6 (02) :109-133
[10]   FAST FOLDING AND COMPARISON OF RNA SECONDARY STRUCTURES [J].
HOFACKER, IL ;
FONTANA, W ;
STADLER, PF ;
BONHOEFFER, LS ;
TACKER, M ;
SCHUSTER, P .
MONATSHEFTE FUR CHEMIE, 1994, 125 (02) :167-188