A two-layer classification framework for protein fold recognition

被引:13
作者
Aram, Reza Zohouri [1 ]
Charkari, Nasrollah Moghadam [1 ]
机构
[1] Univ Tarbiat Modares, Fac Elect & Comp Engn, Tehran, Iran
关键词
Supervised learning; Ensemble classifiers; Fusion system; AMINO-ACID-COMPOSITION; ENSEMBLE CLASSIFIER; SEQUENCE; PREDICTION; NETWORKS;
D O I
10.1016/j.jtbi.2014.09.032
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein fold recognition is one of the interesting studies in bioinformatic to predicting the tertiary structure of proteins. In this paper, an individual method and a fusion method are proposed for protein fold recognition. A Two Layer Classification Framework (TLCF) is proposed as individual method. This framework comprises of two layers: in the first layer, the structural class of protein is predicted. The classifier in this layer classifies the instances into four structural classes: all alpha, all beta, alpha/beta, and alpha+beta. Then, the classification results will be added as a new feature to further training and testing datasets. Using the results of the first layer, we employ another classifier for predicting 27 folding classes in the second layer. The results indicate that the proposed approach is very effective to improve the prediction accuracy where the measured values of MCC, specificity, and sensitivity are promising. TLCF* is proposed as a fusion method that exploits TLCF as a base model. The experimental results indicate that the proposed methods improve prediction accuracy by 2-10% on a benchmark dataset. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:32 / 39
页数:8
相关论文
共 42 条
  • [1] FRAN and RBF-PSO as two components of a hyper framework to recognize protein folds
    Abbasi, Elham
    Ghatee, Mehdi
    Shiri, M. E.
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2013, 43 (09) : 1182 - 1191
  • [2] iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition
    Chen, Wei
    Feng, Peng-Mian
    Lin, Hao
    Chou, Kuo-Chen
    [J]. BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [3] iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition
    Chen, Wei
    Feng, Peng-Mian
    Lin, Hao
    Chou, Kuo-Chen
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (06) : e68
  • [4] Ensemble voting system for multiclass protein fold recognition
    Chen, Yuehui
    Chen, Feng
    Yang, Jack Y.
    Yang, Mary Qu
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2008, 22 (04) : 747 - 763
  • [5] Cheng B., 1994, NEUR NETW REV STATIS, P2
  • [6] A hybrid discriminative/generative approach to protein fold recognition
    Chmielnicki, Wieslaw
    Stapor, Katarzyna
    [J]. NEUROCOMPUTING, 2012, 75 (01) : 194 - 198
  • [7] Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes
    Chou, KC
    [J]. BIOINFORMATICS, 2005, 21 (01) : 10 - 19
  • [8] PREDICTION OF PROTEIN STRUCTURAL CLASSES
    CHOU, KC
    ZHANG, CT
    [J]. CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) : 275 - 349
  • [9] Some remarks on protein attribute prediction and pseudo amino acid composition
    Chou, Kuo-Chen
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2011, 273 (01) : 236 - 247
  • [10] Chou Kuo-Chen., 2009, Natural Science, V1, P63