M-TASSER: An algorithm for protein quaternary structure prediction

被引:49
作者
Chen, Huiling [1 ]
Skolnick, Jeffrey [1 ]
机构
[1] Georgia Inst Technol, Sch Biol, Ctr Study Syst Biol, Atlanta, GA 30332 USA
关键词
D O I
10.1529/biophysj.107.114280
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
In a cell, it has been estimated that each protein on average interacts with roughly 10 others, resulting in tens of thousands of proteins known or suspected to have interaction partners; of these, only a tiny fraction have solved protein structures. To partially address this problem, we have developed M-TASSER, a hierarchical method to predict protein quaternary structure from sequence that involves template identification by multimeric threading, followed by multimer model assembly and refinement. The final models are selected by structure clustering. M-TASSER has been tested on a benchmark set comprising 241 dimers having templates with weak sequence similarity and 246 without multimeric templates in the dimer library. Of the total of 207 targets predicted to interact as dimers, 165 (80%) were correctly assigned as interacting with a true positive rate of 68% and a false positive rate of 17%. The initial best template structures have an average root mean-square deviation to native of 5.3, 6.7, and 7.4 (A) over circle for the monomer, interface, and dimer structures. The final model shows on average a root mean-square deviation improvement of 1.3, 1.3, and 1.5 (A) over circle over the initial template structure for the monomer, interface, and dimer structures, with refinement evident for 87% of the cases. Thus, we have developed a promising approach to predict full-length quaternary structure for proteins that have weak sequence similarity to proteins of solved quaternary structure.
引用
收藏
页码:918 / 928
页数:11
相关论文
共 51 条
  • [1] Proteomics: The society of proteins
    Abbott, A
    [J]. NATURE, 2002, 417 (6892) : 894 - 896
  • [2] Protein complexes:: structure prediction challenges for the 21st century
    Aloy, P
    Pichaud, M
    Russell, RB
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2005, 15 (01) : 15 - 22
  • [3] Ten thousand interactions for the molecular biologist
    Aloy, P
    Russell, RB
    [J]. NATURE BIOTECHNOLOGY, 2004, 22 (10) : 1317 - 1321
  • [4] The relationship between sequence and interaction divergence in proteins
    Aloy, P
    Ceulemans, H
    Stark, A
    Russell, RB
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2003, 332 (05) : 989 - 998
  • [5] InterPreTS: protein Interaction Prediction through Tertiary Structure
    Aloy, P
    Russell, RB
    [J]. BIOINFORMATICS, 2003, 19 (01) : 161 - 162
  • [6] Interrogating protein interaction networks through structural biology
    Aloy, P
    Russell, RB
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (09) : 5896 - 5901
  • [7] ALTSCHUL SF, 1997, NUCLEIC ACIDS RES, V25, P3402
  • [8] Domain combinations in archaeal, eubacterial and eukaryotic proteomes
    Apic, G
    Gough, J
    Teichmann, SA
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (02) : 311 - 325
  • [9] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [10] The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
    Boeckmann, B
    Bairoch, A
    Apweiler, R
    Blatter, MC
    Estreicher, A
    Gasteiger, E
    Martin, MJ
    Michoud, K
    O'Donovan, C
    Phan, I
    Pilbout, S
    Schneider, M
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 365 - 370