A two-layer classification framework for protein fold recognition

被引:13
作者
Aram, Reza Zohouri [1 ]
Charkari, Nasrollah Moghadam [1 ]
机构
[1] Univ Tarbiat Modares, Fac Elect & Comp Engn, Tehran, Iran
关键词
Supervised learning; Ensemble classifiers; Fusion system; AMINO-ACID-COMPOSITION; ENSEMBLE CLASSIFIER; SEQUENCE; PREDICTION; NETWORKS;
D O I
10.1016/j.jtbi.2014.09.032
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein fold recognition is one of the interesting studies in bioinformatic to predicting the tertiary structure of proteins. In this paper, an individual method and a fusion method are proposed for protein fold recognition. A Two Layer Classification Framework (TLCF) is proposed as individual method. This framework comprises of two layers: in the first layer, the structural class of protein is predicted. The classifier in this layer classifies the instances into four structural classes: all alpha, all beta, alpha/beta, and alpha+beta. Then, the classification results will be added as a new feature to further training and testing datasets. Using the results of the first layer, we employ another classifier for predicting 27 folding classes in the second layer. The results indicate that the proposed approach is very effective to improve the prediction accuracy where the measured values of MCC, specificity, and sensitivity are promising. TLCF* is proposed as a fusion method that exploits TLCF as a base model. The experimental results indicate that the proposed methods improve prediction accuracy by 2-10% on a benchmark dataset. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:32 / 39
页数:8
相关论文
共 42 条
[1]   FRAN and RBF-PSO as two components of a hyper framework to recognize protein folds [J].
Abbasi, Elham ;
Ghatee, Mehdi ;
Shiri, M. E. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2013, 43 (09) :1182-1191
[2]   iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition [J].
Chen, Wei ;
Feng, Peng-Mian ;
Lin, Hao ;
Chou, Kuo-Chen .
BIOMED RESEARCH INTERNATIONAL, 2014, 2014
[3]   iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition [J].
Chen, Wei ;
Feng, Peng-Mian ;
Lin, Hao ;
Chou, Kuo-Chen .
NUCLEIC ACIDS RESEARCH, 2013, 41 (06) :e68
[4]   Ensemble voting system for multiclass protein fold recognition [J].
Chen, Yuehui ;
Chen, Feng ;
Yang, Jack Y. ;
Yang, Mary Qu .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2008, 22 (04) :747-763
[5]  
Cheng B., 1994, NEUR NETW REV STATIS, P2
[6]   A hybrid discriminative/generative approach to protein fold recognition [J].
Chmielnicki, Wieslaw ;
Stapor, Katarzyna .
NEUROCOMPUTING, 2012, 75 (01) :194-198
[7]   Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes [J].
Chou, KC .
BIOINFORMATICS, 2005, 21 (01) :10-19
[8]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349
[9]   Some remarks on protein attribute prediction and pseudo amino acid composition [J].
Chou, Kuo-Chen .
JOURNAL OF THEORETICAL BIOLOGY, 2011, 273 (01) :236-247
[10]  
Chou Kuo-Chen., 2009, Natural Science, V1, P63