Cluster-based local modeling approach to protein secondary structure prediction

被引:0
|
作者
Doong, SH [1 ]
Yeh, CY [1 ]
机构
[1] ShuTe Univ, Dept Informat Management, Kaohsiung 824, Taiwan
关键词
protein; secondary structure prediction; support vector machine; clustering; local modeling; genetic algorithm;
D O I
10.1166/jctn.2005.010
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Protein secondary structure can be used to help determine the nanotechnologically relevant tertiary structure of a protein molecule via the fold recognition method. Starting from the primary amino acid sequence, protein secondary structure prediction (PSSP) has been widely studied using a large variety of algorithms. These include support vector machine (SVM) which has been successfully applied to many prediction problems, also PSSP. In this paper, we attack the PSSP problem from another perspective by using local modeling based on clustering. Most previous PSSP solutions improve the prediction accuracy by using more informative encoding schema, better prediction algorithms, and possibly finer methodology such as dual-layer classifiers or consensus voting mechanism. These approaches all adopted a global modeling technique to build a single classifier. Based on the successful applications of local modeling in many fields, we propose a hybrid approach to solve the PSSP problem by preprocessing the protein sequences with a genetic algorithm based clustering before building an individual SVM model for each cluster. Extensive analysis of several datasets of protein sequences and using statistical hypothesis testing, it seems preferable to cluster the sequence data before a classification step is performed in the PSSP problem. The improved prediction of protein secondary structure is important for advanced nanotechnology applications, like biomolecular machines using proteins as their components.
引用
收藏
页码:551 / 560
页数:10
相关论文
共 50 条
  • [21] MACHINE LEARNING APPROACH FOR THE PREDICTION OF PROTEIN SECONDARY STRUCTURE
    KING, RD
    STERNBERG, MJE
    JOURNAL OF MOLECULAR BIOLOGY, 1990, 216 (02) : 441 - 457
  • [22] A Deep Learning Approach for Prediction of Protein Secondary Structure
    Zubair, Muhammad
    Hanif, Muhammad Kashif
    Alabdulkreem, Eatedal
    Ghadi, Yazeed
    Khan, Muhammad Irfan
    Sarwar, Muhammad Umer
    Hanif, Ayesha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 3705 - 3718
  • [23] Cluster-Based Prediction for Batteries in Data Centers
    Haider, Syed Naeem
    Zhao, Qianchuan
    Li, Xueliang
    ENERGIES, 2020, 13 (05)
  • [24] An empirical study of query expansion and cluster-based retrieval in language modeling approach
    Na, Seung-Hoon
    Kang, In-Su
    Roh, Ji-Eun
    Lee, Jong-Hyeok
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (02) : 302 - 314
  • [25] Modeling of a Multizone Gas-Phase Polyethylene Reactor with a Cluster-Based Approach
    Adli, H.
    Mostoufi, N.
    Ghafelebashi, S. M.
    JOURNAL OF APPLIED POLYMER SCIENCE, 2011, 122 (01) : 393 - 405
  • [26] An empirical study of query expansion and cluster-based retrieval in language modeling approach
    Na, SH
    Kang, IS
    Roh, JE
    Lee, JH
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 274 - 287
  • [27] Cluster-based region formation for modeling MANET
    Suoto, Henry
    Sen, Subhabrata Bapi
    2007 IEEE MILITARY COMMUNICATIONS CONFERENCE, VOLS 1-8, 2007, : 1729 - 1735
  • [28] SPOKEN LANGUAGE RECOGNITION WITH CLUSTER-BASED MODELING
    Kacprzak, Stanislaw
    Rybicka, Magdalena
    Kowalczyk, Konrad
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6867 - 6871
  • [29] In silico prediction of protein flexibility with local structure approach
    Narwani, Tarun J.
    Etchebest, Catherine
    Craveur, Pierrick
    Leonard, Sylvain
    Rebehmed, Joseph
    Srinivasan, Narayanaswamy
    Bornot, Aurelie
    Gelly, Jean-Christophe
    de Brevern, Alexandre G.
    BIOCHIMIE, 2019, 165 : 150 - 155
  • [30] HYPROSP II - A knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence
    Lin, HN
    Chang, JM
    Wu, KP
    Sung, TY
    Hsu, WL
    BIOINFORMATICS, 2005, 21 (15) : 3227 - 3233