Meta prediction of protein crystallization propensity

被引:27
作者
Mizianty, Marcin J. [1 ]
Kurgan, Lukasz [1 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, ECERF, Edmonton, AB T6G 2V4, Canada
关键词
Structural genomics; X-ray crystallography; Crystallization propensity prediction; Protein structure; Protein crystallization; STRUCTURAL GENOMICS; ISOELECTRIC POINT; WEB SERVER; DESIGN; INFORMATION; STRATEGIES; SCALE;
D O I
10.1016/j.bbrc.2009.09.036
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Production of high-quality crystals is one of the main bottlenecks in the X-ray crystallography driven protein structure determination. Availability of structure determination data repositories, such as TargetDB and PepcDB, and flexibility in target selection in structural genomics motivate development of methods that predict crystallization propensity from a given protein sequence. We introduce a novel linear model tree-based meta-predictor, MetaPPCP, which takes advantage of the complementarity, of state-of-the-art protein crystallization propensity predictors to provide predictions with about 80% accuracy. Our method combines predictions of XtalPred and CRYSTALP2 with information concerning isoelectric point, hydropathy and number of solved structures for similar sequences. Empirical comparison shows that MetaPPCP outperforms current predictors including OB-Score. XtalPred, ParCrys, and CRYSTALP2. MetaPPCP obtains over 92% accuracy for over a half of its predictions that have probability (propensity to be predicted as crystallizable or crystallization resistant) of above 0.8. The proposed method could provide useful input for target selection procedures of current structural genomics efforts. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:10 / 15
页数:6
相关论文
共 34 条
  • [1] [Anonymous], Data Mining Practical Machine Learning Tools and Techniques with Java
  • [2] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [3] A tour of structural genomics
    Brenner, SE
    [J]. NATURE REVIEWS GENETICS, 2001, 2 (10) : 801 - 809
  • [4] Protein biophysical properties that correlate with crystallization success in Thermotoga maritima:: Maximum clustering strategy for structural genomics
    Canaves, JM
    Page, R
    Wilson, IA
    Stevens, RC
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2004, 344 (04) : 977 - 991
  • [5] Implications of structural genomics target selection strategies: Pfam5000, whole genome, and random approaches
    Chandonia, JM
    Brenner, SE
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (01) : 166 - 179
  • [6] Turning protein crystallisation from an art into a science
    Chayen, NE
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2004, 14 (05) : 577 - 583
  • [7] PFRES: protein fold classification by using evolutionary information and predicted secondary structure
    Chen, Ke
    Kurgan, Lukasz
    [J]. BIOINFORMATICS, 2007, 23 (21) : 2843 - 2850
  • [8] Prediction of protein crystallization using collocation of amino acid pairs
    Chen, Ke
    Kurgan, Lukasz
    Rahbari, Mandana
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2007, 355 (03) : 764 - 769
  • [9] Investigation of Atomic Level Patterns in Protein-Small Ligand Interactions
    Chen, Ke
    Kurgan, Lukasz
    [J]. PLOS ONE, 2009, 4 (02):
  • [10] TargetDB: a target registration database for structural genomics projects
    Chen, L
    Oughtred, R
    Berman, HM
    Westbrook, J
    [J]. BIOINFORMATICS, 2004, 20 (16) : 2860 - 2862