Identifying protein quaternary structural attributes by incorporating physicochemical properties into the general form of Chou's PseAAC via discrete wavelet transform

被引:64
作者
Sun, Xing-Yu [1 ]
Shi, Shao-Ping [1 ]
Qiu, Jian-Ding [1 ,2 ]
Suo, Sheng-Bao [1 ]
Huang, Shu-Yun [1 ]
Liang, Ru-Ping [1 ]
机构
[1] Nanchang Univ, Dept Chem, Nanchang 330031, Peoples R China
[2] Pingxiang Coll, Dept Mat & Chem Engn, Pingxiang 337055, Peoples R China
基金
中国国家自然科学基金;
关键词
AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINES; PHOSPHOLAMBAN PENTAMER; SUBCELLULAR LOCATION; FUNCTIONAL DOMAIN; PREDICTION; CHANNEL; CLASSIFICATION; EVOLUTION;
D O I
10.1039/c2mb25280e
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In vivo, some proteins exist as monomers and others as oligomers. Oligomers can be further classified into homo-oligomers (formed by identical subunits) and hetero-oligomers (formed by different subunits), and they form the structural components of various biological functions, including cooperative effects, allosteric mechanism and ion-channel gating. Therefore, with the avalanche of protein sequences generated in the post-genomic era, it is very important for both basic research and the pharmaceutical industry to acquire the possible knowledge about quaternary structural attributes of their proteins of interest. In view of this, a high throughput method (DWT_DT), a 2-layer approach by fusing discrete wavelet transform (DWT) and decision-tree algorithm (DT) with physicochemical features, has been developed to predict protein quaternary structures. The 1st layer is to assign a query protein to one of the 10 main quaternary structural attributes. The 2nd layer is to evaluate whether the protein in question is composed of homo- or hetero-oligomers. The overall accuracy by jackknife test for the 1st layer identification was 89.60%. The overall accuracy of the 2nd layer varies from 88.23 to 100%. The results suggest that this newly developed protocol (DWT_DT) is very promising in predicting quaternary structures with complicated composition.
引用
收藏
页码:3178 / 3184
页数:7
相关论文
共 67 条
[1]   Analysis of EEG records in an epileptic patient using wavelet transform [J].
Adeli, H ;
Zhou, Z ;
Dadmehr, N .
JOURNAL OF NEUROSCIENCE METHODS, 2003, 123 (01) :69-87
[2]   A new hybrid algorithm for ECG signal compression based on the wavelet transformation of the linearly predicted error [J].
Ahmeda, SM ;
Abo-Zahhad, M .
MEDICAL ENGINEERING & PHYSICS, 2001, 23 (02) :117-126
[3]   Adaptation of protein surfaces to subcellular location [J].
Andrade, MA ;
O'Donoghue, SI ;
Rost, B .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 276 (02) :517-525
[4]   KINETICS OF FORMATION OF NATIVE RIBONUCLEASE DURING OXIDATION OF REDUCED POLYPEPTIDE CHAIN [J].
ANFINSEN, CB ;
HABER, E ;
SELA, M ;
WHITE, FH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1961, 47 (09) :1309-+
[5]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[6]  
[Anonymous], 2014, C4. 5: programs for machine learning
[7]  
[Anonymous], NAT SCI
[8]  
[Anonymous], 2006, 23 INT C MACH LEARN, DOI [DOI 10.1145/1143844.1143874, 10.1145/1143844.1143874]
[9]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[10]   Solution of neutron transport equation using Daubechies' wavelet expansion in the angular discretization [J].
Cao, Liangzhi ;
Wu, Hongchun ;
Zheng, Youqi .
NUCLEAR ENGINEERING AND DESIGN, 2008, 238 (09) :2292-2301