Alternative approach to protein structure prediction based on sequential similarity of physical properties

被引:13
作者
He, Yi [1 ]
Rackovsky, S. [1 ,2 ]
Yin, Yanping [1 ]
Scheraga, Harold A. [1 ]
机构
[1] Cornell Univ, Dept Chem & Chem Biol, Ithaca, NY 14853 USA
[2] Icahn Sch Med Mt Sinai, Dept Pharmacol & Syst Therapeut, New York, NY 10029 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
homology modeling; amino acid physical properties; protein structure prediction; HIDDEN MARKOV-MODELS; ALPHA-LACTALBUMIN; AMINO-ACIDS; PSI-BLAST; SEQUENCES; INFORMATION; ALIGNMENT; LYSOZYME;
D O I
10.1073/pnas.1504806112
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The relationship between protein sequence and structure arises entirely from amino acid physical properties. An alternative method is therefore proposed to identify homologs in which residue equivalence is based exclusively on the pairwise physical property similarities of sequences. This approach, the property factor method (PFM), is entirely different from those in current use. A comparison is made between our method and PSI BLAST. We demonstrate that traditionally defined sequence similarity can be very low for pairs of sequences (which therefore cannot be identified using PSI BLAST), but similarity of physical property distributions results in almost identical 3D structures. The performance of PFM is shown to be better than that of PSI BLAST when sequence matching is comparable, based on a comparison using targets from CASP10 (89 targets) and CASP11 (51 targets). It is also shown that PFM outperforms PSI BLAST in informatically challenging targets.
引用
收藏
页码:5029 / 5032
页数:4
相关论文
共 21 条
[1]   ISSUES IN SEARCHING MOLECULAR SEQUENCE DATABASES [J].
ALTSCHUL, SF ;
BOGUSKI, MS ;
GISH, W ;
WOOTTON, JC .
NATURE GENETICS, 1994, 6 (02) :119-129
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Sequence alignment in molecular biology [J].
Apostolico, A ;
Giancarlo, R .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (02) :173-196
[4]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[5]   A structural census of the current population of protein sequences [J].
Gerstein, M ;
Levitt, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (22) :11911-11916
[6]  
Gribskov M, 1994, Methods Mol Biol, V25, P247
[7]   Hidden Markov models for detecting remote protein homologies [J].
Karplus, K ;
Barrett, C ;
Hughey, R .
BIOINFORMATICS, 1998, 14 (10) :846-856
[8]   Combining local-structure, fold-recognition, and new fold methods for protein structure prediction [J].
Karplus, K ;
Karchin, R ;
Draper, J ;
Casper, J ;
Mandel-Gutfreund, Y ;
Diekhans, M ;
Hughey, R .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :491-496
[9]   RELATION BETWEEN SEQUENCE SIMILARITY AND STRUCTURAL SIMILARITY IN PROTEINS - ROLE OF IMPORTANT PROPERTIES OF AMINO-ACIDS [J].
KIDERA, A ;
KONISHI, Y ;
OOI, T ;
SCHERAGA, HA .
JOURNAL OF PROTEIN CHEMISTRY, 1985, 4 (05) :265-297
[10]   STATISTICAL-ANALYSIS OF THE PHYSICAL-PROPERTIES OF THE 20 NATURALLY-OCCURRING AMINO-ACIDS [J].
KIDERA, A ;
KONISHI, Y ;
OKA, M ;
OOI, T ;
SCHERAGA, HA .
JOURNAL OF PROTEIN CHEMISTRY, 1985, 4 (01) :23-55