SARNA-Ensemble-Predict: The Effect of Different Dissimilarity Metrics on a Novel Ensemble-based RNA Secondary Structure Prediction Algorithm

被引:3
作者
Tsang, Herbert H. [1 ]
Wiese, Kay C. [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Surrey, BC V3T 2W1, Canada
来源
CIBCB: 2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY | 2009年
关键词
CLASSIFICATION; RNAPREDICT;
D O I
10.1109/CIBCB.2009.4925701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there is a resurgence of interest in the RNA secondary structure prediction problem due to the discovery of many new families of non-coding RNAs with a variety of functions. This paper describes and presents a novel algorithm for RNA secondary structure prediction based on an ensemble-based approach. An evaluation of the performance in terms of sensitivity and specificity is made. Experiments were performed on eleven structures from four RNA classes (RNaseP, Group I intron 16S rRNA, Group I intron 23S rRNA and 16S rRNA). Three RNA secondary structure similarity metrics (base pair distance, tree edit distance, and thermodynamic energy distance) and their effects on the clustering algorithm were explored. The significant contribution of this paper is in the examining of the various results from employing different dissimilarity metrics. Overall, the base pair distance dissimilarity metric shows better results with the other two distance metrics (tree edit distance and thermodynamic energy distance). The results presented in this paper demonstrate that SARNA-Ensemble-Predict can give comparable performance to a state-of-the-art algorithm Sfold in terms of sensitivity.
引用
收藏
页码:8 / 15
页数:8
相关论文
共 33 条
  • [1] Azencott R., 1992, Simulated Annealing, VVolume 27
  • [2] Assessing the accuracy of prediction algorithms for classification: an overview
    Baldi, P
    Brunak, S
    Chauvin, Y
    Andersen, CAF
    Nielsen, H
    [J]. BIOINFORMATICS, 2000, 16 (05) : 412 - 424
  • [3] European policewomen; A comparative research perspective
    Brown, J
    [J]. INTERNATIONAL JOURNAL OF THE SOCIOLOGY OF LAW, 1997, 25 (01): : 1 - 19
  • [4] Cannone Jamie J., 2002, BMC Bioinformatics, V3, P1
  • [5] Structure clustering features on the Sfold Web server
    Chan, CY
    Lawrence, CE
    Ding, Y
    [J]. BIOINFORMATICS, 2005, 21 (20) : 3926 - 3928
  • [6] RNA secondary structure prediction by centroids in a Boltzmann weighted ensemble
    Ding, Y
    Chan, CY
    Lawrence, CE
    [J]. RNA, 2005, 11 (08) : 1157 - 1166
  • [7] A statistical sampling algorithm for RNA secondary structure prediction
    Ding, Y
    Lawrence, CE
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (24) : 7280 - 7301
  • [8] STOCHASTIC RELAXATION, GIBBS DISTRIBUTIONS, AND THE BAYESIAN RESTORATION OF IMAGES
    GEMAN, S
    GEMAN, D
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (06) : 721 - 741
  • [9] COOLING SCHEDULES FOR OPTIMAL ANNEALING
    HAJEK, B
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 1988, 13 (02) : 311 - 329
  • [10] Vienna RNA secondary structure server
    Hofacker, IL
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3429 - 3431