Cluster-based local modeling approach to protein secondary structure prediction

被引:0
|
作者
Doong, SH [1 ]
Yeh, CY [1 ]
机构
[1] ShuTe Univ, Dept Informat Management, Kaohsiung 824, Taiwan
关键词
protein; secondary structure prediction; support vector machine; clustering; local modeling; genetic algorithm;
D O I
10.1166/jctn.2005.010
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Protein secondary structure can be used to help determine the nanotechnologically relevant tertiary structure of a protein molecule via the fold recognition method. Starting from the primary amino acid sequence, protein secondary structure prediction (PSSP) has been widely studied using a large variety of algorithms. These include support vector machine (SVM) which has been successfully applied to many prediction problems, also PSSP. In this paper, we attack the PSSP problem from another perspective by using local modeling based on clustering. Most previous PSSP solutions improve the prediction accuracy by using more informative encoding schema, better prediction algorithms, and possibly finer methodology such as dual-layer classifiers or consensus voting mechanism. These approaches all adopted a global modeling technique to build a single classifier. Based on the successful applications of local modeling in many fields, we propose a hybrid approach to solve the PSSP problem by preprocessing the protein sequences with a genetic algorithm based clustering before building an individual SVM model for each cluster. Extensive analysis of several datasets of protein sequences and using statistical hypothesis testing, it seems preferable to cluster the sequence data before a classification step is performed in the PSSP problem. The improved prediction of protein secondary structure is important for advanced nanotechnology applications, like biomolecular machines using proteins as their components.
引用
收藏
页码:551 / 560
页数:10
相关论文
共 50 条
  • [1] Load balancing algorithm in cluster-based RNA secondary structure prediction
    Tan, GM
    Feng, SZ
    Sun, NH
    ISPDC 2005: 4th International Symposium on Parallel and Distributed Computing, 2005, : 91 - 96
  • [2] Modeling the hydrodynamics of downers by cluster-based approach
    Karimipour, Shayan
    Mostoufi, Navid
    Sotudeh-Gharebagh, Rahmat
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2006, 45 (21) : 7204 - 7209
  • [3] A SEGMENT-BASED APPROACH TO PROTEIN SECONDARY STRUCTURE PREDICTION
    PRESNELL, SR
    COHEN, BI
    COHEN, FE
    BIOCHEMISTRY, 1992, 31 (04) : 983 - 993
  • [4] A cluster-based ensemble approach for congenital heart disease prediction
    Kaur, Ishleen
    Ahmad, Tanvir
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 243
  • [5] A knowledge-based approach to protein local structure prediction
    Chen, CT
    Lin, HN
    Wu, KP
    Sung, TY
    Hsu, WL
    PROCEEDINGS OF THE 4TH ASIA-PACIFIC BIOINFORMATICS CONFERENCE, 2006, 3 : 257 - 266
  • [6] Improving protein secondary structure prediction based on short subsequences with local structure similarity
    Lin, Hsin-Nan
    Sung, Ting-Yi
    Ho, Shinn-Ying
    Hsu, Wen-Lian
    BMC GENOMICS, 2010, 11
  • [7] Improving protein secondary structure prediction based on short subsequences with local structure similarity
    Hsin-Nan Lin
    Ting-Yi Sung
    Shinn-Ying Ho
    Wen-Lian Hsu
    BMC Genomics, 11
  • [8] Discovering cluster-based local outliers
    He, ZY
    Xu, XF
    Deng, SC
    PATTERN RECOGNITION LETTERS, 2003, 24 (9-10) : 1641 - 1650
  • [9] Combined approach to protein secondary structure prediction
    Amirova, S. R.
    Machavariani, M. A.
    Filatov, L., V
    Milchevsky, Ju. V.
    Esipova, N. G.
    Tumanyan, V. G.
    Proceedings of the Fourth International Conference on Bioinformatics of Genome Regulation and Structure, Vol 1, 2004, : 231 - 234
  • [10] An innovative cluster-based prediction approach for mass solar site management
    Wang, Jui-Tang
    Nguyen, Thi Anh Tuyet
    Guo, Yu-Hong
    Hsu, Chau-Yun
    Xie, Huang-Jun
    ENERGY & ENVIRONMENT, 2023,