Accurate Prediction of Protein Hot Spots Residues Based on Gentle AdaBoost Algorithm

被引:1
作者
Sun, Zhen [1 ]
Zhang, Jun [1 ]
Zheng, Chun-Hou [1 ]
Wang, Bing [2 ]
Chen, Peng [3 ]
机构
[1] Anhui Univ, Sch Elect Engn & Automat, 111 Jiulong Rd, Hefei 230601, Anhui, Peoples R China
[2] Anhui Univ Technol, Sch Elect & Informat Engn, Maanshan 243032, Peoples R China
[3] Anhui Univ, Inst Hlth Sci, Hefei 230601, Anhui, Peoples R China
来源
INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT I | 2016年 / 9771卷
关键词
Hot spots; Gentle AdaBoost; Protein interaction; SOLVENT ACCESSIBILITY; INTERFACES; RECOGNITION; PRINCIPLES; DATABASE; BINDING;
D O I
10.1007/978-3-319-42291-6_74
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hot spots are critical for protein interactions. Since the experimental method to measure protein hot spots are very cost and time-consuming, computational method is an option to predict hot spots in protein interfaces. In this work, we use Gentle AdaBoost to identify hot spot without feature selection. For all algorithm, ASEdb and BID are used as separate training and test dataset. We extract sequential and structural information of protein, which constitutes 178 dimensional feature vectors. Comparing with other algorithms, our proposed algorithm obtained satisfactory experimental results either on training dataset or test dataset, which yeilds F1 score of 0.79 and 0.69 on training dataset and test dataset, respectively.
引用
收藏
页码:742 / 749
页数:8
相关论文
共 17 条
[1]   Anatomy of hot spots in protein interfaces [J].
Bogan, AA ;
Thorn, KS .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 280 (01) :1-9
[2]  
Breiman F, 1984, OLSHEN STONE CLASSIF
[3]   A feature-based approach to modeling proteinprotein interaction hot spots [J].
Cho, Kyu-il ;
Kim, Dongsup ;
Lee, Doheon .
NUCLEIC ACIDS RESEARCH, 2009, 37 (08) :2672-2687
[4]   PRINCIPLES OF PROTEIN-PROTEIN RECOGNITION [J].
CHOTHIA, C ;
JANIN, J .
NATURE, 1975, 256 (5520) :705-708
[5]   An automated decision-tree approach to predicting protein interaction hot spots [J].
Darnell, Steven J. ;
Page, David ;
Mitchell, Julie C. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 68 (04) :813-823
[6]  
Demirkir C, 2005, LECT NOTES COMPUT SC, V3546, P339
[7]   The binding interface database (BID): a compilation of amino acid hot spots in protein interfaces [J].
Fischer, TB ;
Arunachalam, KV ;
Bailey, D ;
Mangual, V ;
Bakhru, S ;
Russo, R ;
Huang, D ;
Paczkowski, M ;
Lalchandani, V ;
Ramachandra, C ;
Ellison, B ;
Galer, S ;
Shapley, J ;
Fuentes, E ;
Tsai, J .
BIOINFORMATICS, 2003, 19 (11) :1453-1454
[8]   Additive logistic regression: A statistical view of boosting - Rejoinder [J].
Friedman, J ;
Hastie, T ;
Tibshirani, R .
ANNALS OF STATISTICS, 2000, 28 (02) :400-407
[9]  
JANIN J, 1995, BIOCHIMIE, V77, P497, DOI 10.1016/0300-9084(96)88166-1
[10]   Protein interactions and disease: computational approaches to uncover the etiology of diseases [J].
Kann, Maricel G. .
BRIEFINGS IN BIOINFORMATICS, 2007, 8 (05) :333-346