Prediction of DNA-binding propensity of proteins by the ball-histogram method using automatic template search

被引:16
作者
Szaboova, Andrea [1 ]
Kuzelka, Ondrej [1 ]
Zelezny, Filip [1 ]
Tolar, Jakub [2 ]
机构
[1] Czech Tech Univ, Dept Cybernet, Prague 16627, Czech Republic
[2] Univ Minnesota, Dept Pediat, Minneapolis, MN 55455 USA
关键词
SITES;
D O I
10.1186/1471-2105-13-S10-S3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We contribute a novel, ball-histogram approach to DNA-binding propensity prediction of proteins. Unlike state-of-the-art methods based on constructing an ad-hoc set of features describing physicochemical properties of the proteins, the ball-histogram technique enables a systematic, Monte-Carlo exploration of the spatial distribution of amino acids complying with automatically selected properties. This exploration yields a model for the prediction of DNA binding propensity. We validate our method in prediction experiments, improving on state-of-the-art accuracies. Moreover, our method also provides interpretable features involving spatial distributions of selected amino acids.
引用
收藏
页数:11
相关论文
共 22 条
[1]   Moment-based prediction of DNA-binding proteins [J].
Ahmad, S ;
Sarai, A .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 341 (01) :65-71
[2]  
[Anonymous], 1943, Bull Calcutta Math Soc, DOI DOI 10.1038/157869B0
[3]  
[Anonymous], 2008, PROC 25 INT C MACH L
[4]  
[Anonymous], 2000, WILEY PS TX, DOI 10.1002/0471722146
[5]   Kernel-based machine learning protocol for predicting DNA-binding proteins [J].
Bhardwaj, N ;
Langlois, RE ;
Zhao, GJ ;
Lu, H .
NUCLEIC ACIDS RESEARCH, 2005, 33 (20) :6486-6493
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]  
Cathomen T, 2008, MOL THERAPY, V16
[9]  
Hastie I, 2001, ELEMENTS STAT LEARNI
[10]   Using electrostatic potentials to predict DNA-binding sites on DNA-binding proteins [J].
Jones, S ;
Shanahan, HP ;
Berman, HM ;
Thornton, JM .
NUCLEIC ACIDS RESEARCH, 2003, 31 (24) :7189-7198