SHIFTX2: significantly improved protein chemical shift prediction

被引:514
作者
Han, Beomsoo [1 ]
Liu, Yifeng [1 ]
Ginzinger, Simon W. [4 ]
Wishart, David S. [1 ,2 ,3 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
[2] Univ Alberta, Dept Biol Sci, Edmonton, AB, Canada
[3] CNR, NINT, Edmonton, AB T6G 2E8, Canada
[4] Salzburg Univ, Dept Mol Biol, Div Bioinformat, Ctr Appl Mol Engn, A-5020 Salzburg, Austria
基金
加拿大自然科学与工程研究理事会; 奥地利科学基金会;
关键词
NMR; Protein; Chemical shift; Machine learning; WEB SERVER; STRUCTURE GENERATION; SECONDARY-STRUCTURE; MAGNETIC-RESONANCE; C-ALPHA; NMR; C-13; N-15; H-1; C-13(ALPHA);
D O I
10.1007/s10858-011-9478-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A new computer program, called SHIFTX2, is described which is capable of rapidly and accurately calculating diamagnetic H-1, C-13 and N-15 chemical shifts from protein coordinate data. Compared to its predecessor (SHIFTX) and to other existing protein chemical shift prediction programs, SHIFTX2 is substantially more accurate (up to 26% better by correlation coefficient with an RMS error that is up to 3.3x smaller) than the next best performing program. It also provides significantly more coverage (up to 10% more), is significantly faster (up to 8.5x) and capable of calculating a wider variety of backbone and side chain chemical shifts (up to 6x) than many other shift predictors. In particular, SHIFTX2 is able to attain correlation coefficients between experimentally observed and predicted backbone chemical shifts of 0.9800 (N-15), 0.9959 (C-13 alpha), 0.9992 (C-13 beta), 0.9676 (C-13'), 0.9714 ((HN)-H-1), 0.9744 (H-1 alpha) and RMS errors of 1.1169, 0.4412, 0.5163, 0.5330, 0.1711, and 0.1231 ppm, respectively. The correlation between SHIFTX2's predicted and observed side chain chemical shifts is 0.9787 (C-13) and 0.9482 (H-1) with RMS errors of 0.9754 and 0.1723 ppm, respectively. SHIFTX2 is able to achieve such a high level of accuracy by using a large, high quality database of training proteins (> 190), by utilizing advanced machine learning techniques, by incorporating many more features (chi(2) and chi(3) angles, solvent accessibility, H-bond geometry, pH, temperature), and by combining sequence-based with structure-based chemical shift prediction techniques. With this substantial improvement in accuracy we believe that SHIFTX2 will open the door to many long-anticipated applications of chemical shift prediction to protein structure determination, refinement and validation. SHIFTX2 is available both as a standalone program and as a web server (http://www.shiftx2.ca).
引用
收藏
页码:43 / 57
页数:15
相关论文
共 49 条
[41]   CS23D: a web server for rapid protein structure generation using NMR chemical shifts and sequence data [J].
Wishart, David S. ;
Arndt, David ;
Berjanskii, Mark ;
Tang, Peter ;
Zhou, Jianjun ;
Lin, Guohui .
NUCLEIC ACIDS RESEARCH, 2008, 36 :W496-W502
[42]   Automated 1H and 13C chemical shift prediction using the BioMagResBank [J].
Wishart, DS ;
Watson, MS ;
Boyko, RF ;
Sykes, BD .
JOURNAL OF BIOMOLECULAR NMR, 1997, 10 (04) :329-336
[43]   H-1, C-13 AND N-15 CHEMICAL-SHIFT REFERENCING IN BIOMOLECULAR NMR [J].
WISHART, DS ;
BIGAM, CG ;
YAO, J ;
ABILDGAARD, F ;
DYSON, HJ ;
OLDFIELD, E ;
MARKLEY, JL ;
SYKES, BD .
JOURNAL OF BIOMOLECULAR NMR, 1995, 6 (02) :135-140
[44]   RELATIONSHIP BETWEEN NUCLEAR-MAGNETIC-RESONANCE CHEMICAL-SHIFT AND PROTEIN SECONDARY STRUCTURE [J].
WISHART, DS ;
SYKES, BD ;
RICHARDS, FM .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 222 (02) :311-333
[45]   H-1, C-13 AND N-15 RANDOM COIL NMR CHEMICAL-SHIFTS OF THE COMMON AMINO-ACIDS .1. INVESTIGATIONS OF NEAREST-NEIGHBOR EFFECTS [J].
WISHART, DS ;
BIGAM, CG ;
HOLM, A ;
HODGES, RS ;
SYKES, BD .
JOURNAL OF BIOMOLECULAR NMR, 1995, 5 (01) :67-81
[46]   Protein chemical shift analysis: a practical guide [J].
Wishart, DS ;
Nip, AM .
BIOCHEMISTRY AND CELL BIOLOGY-BIOCHIMIE ET BIOLOGIE CELLULAIRE, 1998, 76 (2-3) :153-163
[47]   Asparagine and glutamine: Using hydrogen atom contacts in the choice of side-chain amide orientation [J].
Word, JM ;
Lovell, SC ;
Richardson, JS ;
Richardson, DC .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 285 (04) :1735-1747
[48]   Automated prediction of 15N, 13Cα, 13Cβ and 13C′ chemical shifts in proteins using a density functional database [J].
Xu, XP ;
Case, DA .
JOURNAL OF BIOMOLECULAR NMR, 2001, 21 (04) :321-333
[49]   RefDB: A database of uniformly referenced protein chemical shifts [J].
Zhang, HY ;
Neal, S ;
Wishart, DS .
JOURNAL OF BIOMOLECULAR NMR, 2003, 25 (03) :173-195