PARTS: Probabilistic Alignment for RNA joinT Secondary structure prediction

被引:30
|
作者
Harmanci, Arif Ozgun [1 ]
Sharma, Gaurav [1 ,2 ]
Mathews, David H. [2 ,3 ]
机构
[1] Univ Rochester, Dept Elect & Comp Engn, Rochester, NY 14627 USA
[2] Univ Rochester, Med Ctr, Dept Biostat & Computat Biol, Rochester, NY 14642 USA
[3] Univ Rochester, Med Ctr, Dept Biochem & Biophys, Rochester, NY 14642 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1093/nar/gkn043
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A novel method is presented for joint prediction of alignment and common secondary structures of two RNA sequences. The joint consideration of common secondary structures and alignment is accomplished by structural alignment over a search space defined by the newly introduced motif called matched helical regions. The matched helical region formulation generalizes previously employed constraints for structural alignment and thereby better accommodates the structural variability within RNA families. A probabilistic model based on pseudo free energies obtained from precomputed base pairing and alignment probabilities is utilized for scoring structural alignments. Maximum a posteriori (MAP) common secondary structures, sequence alignment and joint posterior probabilities of base pairing are obtained from the model via a dynamic programming algorithm called PARTS. The advantage of the more general structural alignment of PARTS is seen in secondary structure predictions for the RNase P family. For this family, the PARTS MAP predictions of secondary structures and alignment perform significantly better than prior methods that utilize a more restrictive structural alignment model. For the tRNA and 5S rRNA families, the richer structural alignment model of PARTS does not offer a benefit and the method therefore performs comparably with existing alternatives. For all RNA families studied, the posterior probability estimates obtained from PARTS offer an improvement over posterior probability estimates from a single sequence prediction. When considering the base pairings predicted over a threshold value of confidence, the combination of sensitivity and positive predictive value is superior for PARTS than for the single sequence prediction. PARTS source code is available for download under the GNU public license at http://rna.urmc.rochester.edu.
引用
收藏
页码:2406 / 2417
页数:12
相关论文
共 50 条
  • [1] JOINT STOCHASTIC SAMPLING FOR RNA SECONDARY STRUCTURE PREDICTION
    Harmanci, Arif Ozgun
    Sharma, Gaurav
    Mathews, David H.
    2009 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2009), 2009, : 65 - +
  • [2] Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign
    Harmanci, Arif Ozgun
    Sharma, Gaurav
    Mathews, David H.
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [3] Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign
    Arif Ozgun Harmanci
    Gaurav Sharma
    David H Mathews
    BMC Bioinformatics, 8
  • [4] StatAlign 2.0: combining statistical alignment with RNA secondary structure prediction
    Arunapuram, Preeti
    Edvardsson, Ingolfur
    Golden, Michael
    Anderson, James W. J.
    Novak, Adam
    Sukosd, Zsuzsanna
    Hein, Jotun
    BIOINFORMATICS, 2013, 29 (05) : 654 - 655
  • [5] Data-directed RNA secondary structure prediction using probabilistic modeling
    Deng, Fei
    Ledda, Mirko
    Vaziri, Sana
    Aviran, Sharon
    RNA, 2016, 22 (08) : 1109 - 1119
  • [6] ProbFold: a probabilistic method for integration of probing data in RNA secondary structure prediction
    Sahoo, Sudhakar
    Switnicki, Michal P.
    Pedersen, Jakob Skou
    BIOINFORMATICS, 2016, 32 (17) : 2626 - 2635
  • [7] PREDICTION OF RNA SECONDARY STRUCTURE
    DELISI, C
    CROTHERS, DM
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1971, 68 (11) : 2682 - &
  • [8] RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment
    Xu, Xing
    Ji, Yongmei
    Stormo, Gary D.
    BIOINFORMATICS, 2007, 23 (15) : 1883 - 1891
  • [9] TurboFold II: RNA structural alignment and secondary structure prediction informed by multiple homologs
    Tan, Zhen
    Fu, Yinghan
    Sharma, Gaurav
    Mathews, David H.
    NUCLEIC ACIDS RESEARCH, 2017, 45 (20) : 11570 - 11581
  • [10] Probabilistic methods for improving efficiency of RNA secondary structure prediction across multiple sequences
    Sharma, Gaurav
    Harmanci, A. Ozgun
    Mathews, David H.
    CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 34 - +