共 20 条
Improved method for predicting protein fold patterns with ensemble classifiers
被引:22
作者:
Chen, W.
[1
]
Liu, X.
[1
,3
,4
]
Huang, Y.
[2
]
Jiang, Y.
[1
]
Zou, Q.
[1
]
Lin, C.
[1
]
机构:
[1] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Fujian, Peoples R China
[2] Henan Univ Sci & Technol, Anim Sci & Technol Coll, Luoyang, Henan, Peoples R China
[3] Xiamen Univ, Shenzhen Res Inst, Guangzhou, Guangdong, Peoples R China
[4] Dalian Univ, Minist Educ, Key Lab Adv Design & Intelligent Comp, Dalian, Peoples R China
关键词:
Protein folding pattern;
Ensemble classifier;
Machine learning;
Bioinformatics;
CLASSIFICATION;
DATABASE;
D O I:
10.4238/2012.January.27.4
中图分类号:
Q5 [生物化学];
Q7 [分子生物学];
学科分类号:
071010 ;
081704 ;
摘要:
Protein folding is recognized as a critical problem in the field of biophysics in the 21st century. Predicting protein-folding patterns is challenging due to the complex structure of proteins. In an attempt to solve this problem, we employed ensemble classifiers to improve prediction accuracy. In our experiments, 188-dimensional features were extracted based on the composition and physical-chemical property of proteins and 20-dimensional features were selected using a coupled position-specific scoring matrix. Compared with traditional prediction methods, these methods were superior in terms of prediction accuracy. The 188-dimensional feature-based method achieved 71.2% accuracy in five cross-validations. The accuracy rose to 77% when we used a 20-dimensional feature vector. These methods were used on recent data, with 54.2% accuracy. Source codes and dataset, together with web server and software tools for prediction, are available at: http://datamining.xmu.edu.cn/main/similar to cwc/ProteinPredict.html.
引用
收藏
页码:174 / 181
页数:8
相关论文