Consensus RNA secondary structure prediction using information of neighbouring columns and principal component analysis

被引:0
作者
Liu, Tianhang [1 ]
Yin, Jianping [1 ]
Gao, Long [1 ]
Chen, Wei [1 ]
Qiu, Minghui [2 ]
机构
[1] Natl Univ Def Technol, Comp Sch, Changsha, Hunan, Peoples R China
[2] Chinese Peoples Liberat Army Gen Hosp, Med Informat Inst, Beijing, Peoples R China
关键词
RNA secondary structure prediction; comparative sequence analysis; principal component analysis; PCA; information of neighbouring columns;
D O I
10.1504/ijcse.2019.10022734
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
RNA is a family of biological macromolecules. It is important to all kinds of biological processes. RNA structure is closely related to its functions. Hence, determining the structure is invaluable in understanding genetic diseases and creating drugs. Nowadays, RNA secondary structure prediction is a field yet to be researched. In this paper, we present a novel method using RNA sequence alignment to predict a consensus RNA secondary structure. In essence, the goal of the method is to give a prediction about whether any two columns of an alignment correspond to a base pair or not, using the information provided by the alignment. The information includes the covariation score, the fraction of complementary nucleotides and the consensus probability matrix of the column pair and those of its neighbours. Then principal component analysis is applied to overcome the problem of over-fitting. A comparison of our method and other consensus RNA secondary structure prediction methods including NeCFold, ELMFold, KnetFold, PFold and RNAalifold, in 47 families from Rfam (version 11.0) is performed. Results show that our method surpasses the other methods in terms of Matthews correlation coefficient, sensitivity and selectivity.
引用
收藏
页码:430 / 439
页数:10
相关论文
共 29 条
[1]   Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine [J].
Batey, RT ;
Gilbert, SD ;
Montange, RK .
NATURE, 2004, 432 (7015) :411-415
[2]   RNAalifold: improved consensus structure prediction for RNA alignments [J].
Bernhart, Stephan H. ;
Hofacker, Ivo L. ;
Will, Sebastian ;
Gruber, Andreas R. ;
Stadler, Peter F. .
BMC BIOINFORMATICS, 2008, 9 (1)
[3]   RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers [J].
Bindewald, E ;
Shapiro, BA .
RNA, 2006, 12 (03) :342-352
[4]   Extreme Learning Machines [J].
Cambria, Erik ;
Huang, Guang-Bin .
IEEE INTELLIGENT SYSTEMS, 2013, 28 (06) :30-31
[5]   The role of tRNA and ribosome competition in coupling the expression of different mRNAs in Saccharomyces cerevisiae [J].
Chu, Dominique ;
Barnes, David J. ;
von der Haar, Tobias .
NUCLEIC ACIDS RESEARCH, 2011, 39 (15) :6705-6714
[6]   Algorithmic skeletons for multi-core, multi-GPU systems and clusters [J].
Ernsting, Steffen ;
Kuchen, Herbert .
International Journal of High Performance Computing and Networking, 2012, 7 (02) :129-138
[7]   NMR spectroscopy: a multifaceted approach to macromolecular structure [J].
Ferentz, AE ;
Wagner, G .
QUARTERLY REVIEWS OF BIOPHYSICS, 2000, 33 (01) :29-65
[8]   Secondary structure is required for 3′ splice site recognition in yeast [J].
Gahura, Ondrej ;
Hammann, Christian ;
Valentova, Anna ;
Puta, Frantisek ;
Folk, Petr .
NUCLEIC ACIDS RESEARCH, 2011, 39 (22) :9759-9767
[9]   A comprehensive comparison of comparative RNA structure prediction approaches [J].
Gardner, PP ;
Giegerich, R .
BMC BIOINFORMATICS, 2004, 5 (1)
[10]   Secondary structure prediction for aligned RNA sequences [J].
Hofacker, IL ;
Fekete, M ;
Stadler, PF .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 319 (05) :1059-1066