HYBRID SYSTEM FOR PROTEIN SECONDARY STRUCTURE PREDICTION

被引:118
|
作者
ZHANG, X
MESIROV, JP
WALTZ, DL
机构
[1] Thinking Machines Corporation, Cambridge, MA 02142
关键词
PROTEIN SECONDARY STRUCTURE PREDICTION; HYBRID SYSTEM; NEURAL NETWORKS; MEMORY-BASED REASONING; STATISTICAL METHODS;
D O I
10.1016/0022-2836(92)90104-R
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We have developed a hybrid system to predict the secondary structures (α-helix, β-sheet and coil) of proteins and achieved 66.4% accuracy, with correlation coefficients of Ccoil = 0.429, Cα = 0.470 and Cβ = 0.387. This system contains three subsystems ("experts"): a neural network module, a statistical module and a memory-based reasoning module. First, the three experts independently learn the mapping between amino acid sequences and secondary structures from the known protein structures, then a Combiner learns to combine automatically the outputs of the experts to make final predictions. The hybrid system was tested with 107 protein structures through k-way cross-validation. Its performance was better than each expert and all previously reported methods with greater than 0.99 statistical significance. It was observed that for 20% of the residues, all three experts produced the same but wrong predictions. This may suggest an upper bound on the accuracy of secondary structure predictions based on local information from the currently available protein structures, and indicate places where non-local interactions may play a dominant role in conformation. For 64% of the residues, at least two experts were the same and correct, which shows that the Combiner performed better than majority vote. For 77 % of the residues, at least one expert was correct, thus there may still be room for improvement in this hybrid approach. Rigorous evaluation procedures were used in testing the hybrid system, and statistical significance measures were developed in analyzing the differences among different methods. When measured in terms of the number of secondary structures (rather than the number of residues) that were predicted correctly, the prediction produced by the hybrid system was also better than those of individual experts. © 1992.
引用
收藏
页码:1049 / 1063
页数:15
相关论文
共 50 条
  • [1] Hybrid SVM kernels for protein secondary structure prediction
    Altun, Gulsah
    Hu, Hae-Jin
    Brinza, Dumitru
    Harrison, Robert W.
    Zelikovsky, Alex
    Pan, Yi
    2006 IEEE International Conference on Granular Computing, 2006, : 762 - 765
  • [2] A Hybrid Method for Prediction of Protein Secondary Structure Based on Multiple Artificial Neural Networks
    Hasic, Haris
    Buza, Emir
    Akagic, Amila
    2017 40TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2017, : 1195 - 1200
  • [3] New Trend of Protein Secondary Structure Prediction
    Zhai, Yun
    Yang, Bingru
    Wang, Lijun
    An, Bing
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT UBIQUITOUS COMPUTING AND EDUCATION, 2009, : 121 - +
  • [4] A new hybrid coding for protein secondary structure prediction based on primary structure similarity
    Li, Zhong
    Wang, Jing
    Zhang, Shunpu
    Zhang, Qifeng
    Wu, Wuming
    GENE, 2017, 618 : 8 - 13
  • [5] The Research of Protein Secondary Structure Prediction System Based on KDTICM
    Yang, Bingru
    Hou, Wei
    Xie, Yonghong
    Wang, Lijun
    WCECS 2009: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 47 - 51
  • [6] Protein secondary structure prediction: A survey of the state of the art
    Jiang, Qian
    Jin, Xin
    Lee, Shin-Jye
    Yao, Shaowen
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2017, 76 : 379 - 402
  • [7] A Hybrid Genetic-Neural System for Predicting Protein Secondary Structure
    Giuliano Armano
    Gianmaria Mancosu
    Luciano Milanesi
    Alessandro Orro
    Massimiliano Saba
    Eloisa Vargiu
    BMC Bioinformatics, 6
  • [8] Parallelized protein secondary structure prediction
    Qi, YT
    Lin, F
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2074 - 2077
  • [9] Fast learning optimized prediction methodology (FLOPRED) for protein secondary structure prediction
    S. Saraswathi
    J. L. Fernández-Martínez
    A. Kolinski
    R. L. Jernigan
    A. Kloczkowski
    Journal of Molecular Modeling, 2012, 18 : 4275 - 4289
  • [10] Fast learning optimized prediction methodology (FLOPRED) for protein secondary structure prediction
    Saraswathi, S.
    Fernandez-Martinez, J. L.
    Kolinski, A.
    Jernigan, R. L.
    Kloczkowski, A.
    JOURNAL OF MOLECULAR MODELING, 2012, 18 (09) : 4275 - 4289