Prediction of protein structural classes using support vector machines

被引:134
作者
Sun, X. -D. [1 ]
Huang, R. -B. [1 ]
机构
[1] Guangxi Univ, Coll Life Sci & Biotechnol, Nanning 530004, Guangxi, Peoples R China
关键词
support vector machines; CATH; multi-class; protein structural class prediction; jackknifing;
D O I
10.1007/s00726-005-0239-0
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The support vector machine, a machine-learning method, is used to predict the four structural classes, i.e. mainly alpha, mainly beta, alpha-beta and fss, from the topology-level of CATH protein structure database. For the binary classification, any two structural classes which do not share any secondary structure such as alpha and beta elements could be classified with as high as 90% accuracy. The accuracy, however, will decrease to less than 70% if the structural classes to be classified contain structure elements in common. Our study also shows that the dimensions of feature space 20(2) = 400 (for dipeptide) and 20(3) = 8 000 (for tripeptide) give nearly the same prediction accuracy. Among these 4 structural classes, multi-class classification gives an overall accuracy of about 52%, indicating that the multi-class classification technique in support of vector machines may still need to be further improved in future investigation.
引用
收藏
页码:469 / 475
页数:7
相关论文
共 50 条
[21]   Nonlinear Structural Damage Detection using Support Vector Machines [J].
Xiao, Li ;
Qu, Wenzhong .
HEALTH MONITORING OF STRUCTURAL AND BIOLOGICAL SYSTEMS 2012, 2012, 8348
[22]   Using support vector machines for mining regression classes in large data sets [J].
Sun, ZH ;
Gao, LX ;
Sun, YX .
2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, :89-92
[23]   Prediction of Protein Secondary Structure Based on NMR Chemical Shift Data Using Support Vector Machines [J].
Sabouri, Ahmad ;
Ardalan, Adel ;
Shahidi-Nejad, Reza .
2010 12TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2010, :201-205
[24]   Prediction of flow rate of karstic springs using support vector machines [J].
Goyal, Manish Kumar ;
Sharma, Ashutosh ;
Katsifarakis, Konstantinos L. .
HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2017, 62 (13) :2175-2186
[25]   Substructure Prediction from Infrared Spectra by Using Support Vector Machines [J].
Jun Hong LIU Min Chun LU Fu Sheng NIE Xiao Yu FENG Meng Long LI College of Chemistry Sichuan University Chengdu .
ChineseChemicalLetters, 2005, (10) :1354-1356
[26]   Prediction of Drosophila melanogaster gene function using Support Vector Machines [J].
Mitsakakis, Nicholas ;
Razak, Zak ;
Escobar, Michael ;
Westwood, J. Timothy .
BIODATA MINING, 2013, 6
[27]   Prediction of significant wave height using regressive support vector machines [J].
Mahjoobi, J. ;
Mosabbeb, Ehsan Adeli .
OCEAN ENGINEERING, 2009, 36 (05) :339-347
[28]   Substructure prediction from infrared spectra by using support vector machines [J].
Liu, JH ;
Lu, MC ;
Nie, FS ;
Feng, XY ;
Li, ML .
CHINESE CHEMICAL LETTERS, 2005, 16 (10) :1354-1356
[29]   Prediction of Wool Knitwear Pilling Propensity using Support Vector Machines [J].
Yap, Poh Hean ;
Wang, Xungai ;
Wang, Lijing ;
Ong, Kok-Leong .
TEXTILE RESEARCH JOURNAL, 2010, 80 (01) :77-83
[30]   Prediction of bankruptcy using support vector machines: an application to bank bankruptcy [J].
Erdogan, Birsen Eygi .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2013, 83 (08) :1543-1555