Critical assessment of methods of protein structure prediction (CASP)-Round XV

被引:32
作者
Kryshtafovych, Andriy [1 ]
Schwede, Torsten [2 ,3 ]
Topf, Maya [4 ,5 ]
Fidelis, Krzysztof [1 ]
Moult, John [6 ,7 ]
机构
[1] Univ Calif Davis, Genome Ctr, Davis, CA USA
[2] Univ Basel, Biozentrum, Basel, Switzerland
[3] Univ Basel, SIB Swiss Inst Bioinformat, Basel, Switzerland
[4] Leibniz Inst Virol, Ctr Struct Syst Biol, Hamburg, Germany
[5] Univ Klinikum Hamburg Eppendorf UKE, Hamburg, Germany
[6] Inst Biosci & Biotechnol Res, Rockville, MD 20850 USA
[7] Univ Maryland, Dept Cell Biol & Mol Genet, College Pk, MD USA
关键词
CASP; community wide experiment; protein structure prediction;
D O I
10.1002/prot.26617
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Computing protein structure from amino acid sequence information has been a long-standing grand challenge. Critical assessment of structure prediction (CASP) conducts community experiments aimed at advancing solutions to this and related problems. Experiments are conducted every 2 years. The 2020 experiment (CASP14) saw major progress, with the second generation of deep learning methods delivering accuracy comparable with experiment for many single proteins. There is an expectation that these methods will have much wider application in computational structural biology. Here we summarize results from the most recent experiment, CASP15, in 2022, with an emphasis on new deep learning-driven progress. Other papers in this special issue of proteins provide more detailed analysis. For single protein structures, the AlphaFold2 deep learning method is still superior to other approaches, but there are two points of note. First, although AlphaFold2 was the core of all the most successful methods, there was a wide variety of implementation and combination with other methods. Second, using the standard AlphaFold2 protocol and default parameters only produces the highest quality result for about two thirds of the targets, and more extensive sampling is required for the others. The major advance in this CASP is the enormous increase in the accuracy of computed protein complexes, achieved by the use of deep learning methods, although overall these do not fully match the performance for single proteins. Here too, AlphaFold2 based method perform best, and again more extensive sampling than the defaults is often required. Also of note are the encouraging early results on the use of deep learning to compute ensembles of macromolecular structures. Critically for the usability of computed structures, for both single proteins and protein complexes, deep learning derived estimates of both local and global accuracy are of high quality, however the estimates in interface regions are slightly less reliable. CASP15 also included computation of RNA structures for the first time. Here, the classical approaches produced better agreement with experiment than the new deep learning ones, and accuracy is limited. Also, for the first time, CASP included the computation of protein-ligand complexes, an area of special interest for drug design. Here too, classical methods were still superior to deep learning ones. Many new approaches were discussed at the CASP conference, and it is clear methods will continue to advance.
引用
收藏
页码:1539 / 1549
页数:11
相关论文
共 39 条
  • [1] Protein target highlights in CASP15: Analysis of models by structure providers
    Alexander, Leila T.
    Durairaj, Janani
    Kryshtafovych, Andriy
    Abriata, Luciano A.
    Bayo, Yusupha
    Bhabha, Gira
    Breyton, Cecile
    Caulton, Simon G.
    Chen, James
    Degroux, Seraphine
    Ekiert, Damian C.
    Erlandsen, Benedikte S.
    Freddolino, Peter L.
    Gilzer, Dominic
    Greening, Chris
    Grimes, Jonathan M.
    Grinter, Rhys
    Gurusaran, Manickam
    Hartmann, Marcus D.
    Hitchman, Charlie J.
    Keown, Jeremy R.
    Kropp, Ashleigh
    Kursula, Petri
    Lovering, Andrew L.
    Lemaitre, Bruno
    Lia, Andrea
    Liu, Shiheng
    Logotheti, Maria
    Lu, Shuze
    Markusson, Sigurbjorn
    Miller, Mitchell D.
    Minasov, George
    Niemann, Hartmut H.
    Opazo, Felipe
    Phillips Jr, George N. N.
    Davies, Owen R.
    Rommelaere, Samuel
    Rosas-Lemus, Monica
    Roversi, Pietro
    Satchell, Karla
    Smith, Nathan
    Wilson, Mark A.
    Wu, Kuan-Lin
    Xia, Xian
    Xiao, Han
    Zhang, Wenhua
    Zhou, Z. Hong
    Fidelis, Krzysztof
    Topf, Maya
    Moult, John
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2023, 91 (12) : 1571 - 1599
  • [2] PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS
    ANFINSEN, CB
    [J]. SCIENCE, 1973, 181 (4096) : 223 - 230
  • [3] Baek M., 2023, BIORXIV, DOI [10.1101/2023.05.24.542179, DOI 10.1101/2023.05.24.542179]
  • [4] Accurate prediction of protein structures and interactions using a three-track neural network
    Baek, Minkyung
    DiMaio, Frank
    Anishchenko, Ivan
    Dauparas, Justas
    Ovchinnikov, Sergey
    Lee, Gyu Rie
    Wang, Jue
    Cong, Qian
    Kinch, Lisa N.
    Schaeffer, R. Dustin
    Millan, Claudia
    Park, Hahnbeom
    Adams, Carson
    Glassman, Caleb R.
    DeGiovanni, Andy
    Pereira, Jose H.
    Rodrigues, Andria V.
    van Dijk, Alberdina A.
    Ebrecht, Ana C.
    Opperman, Diederik J.
    Sagmeister, Theo
    Buhlheller, Christoph
    Pavkov-Keller, Tea
    Rathinaswamy, Manoj K.
    Dalwadi, Udit
    Yip, Calvin K.
    Burke, John E.
    Garcia, K. Christopher
    Grishin, Nick V.
    Adams, Paul D.
    Read, Randy J.
    Baker, David
    [J]. SCIENCE, 2021, 373 (6557) : 871 - +
  • [5] Machine-learning methods for ligand-protein molecular docking
    Crampon, Kevin
    Giorkallos, Alexis
    Deldossi, Myrtille
    Baud, Stephanie
    Steffenel, Luiz Angelo
    [J]. DRUG DISCOVERY TODAY, 2022, 27 (01) : 151 - 164
  • [6] Assessment of three-dimensional RNA structure prediction in CASP15
    Das, Rhiju
    Kretsch, Rachael C.
    Simpkin, Adam J.
    Mulvaney, Thomas
    Pham, Phillip
    Rangan, Ramya
    Bu, Fan
    Keegan, Ronan M.
    Topf, Maya
    Rigden, Daniel J.
    Miao, Zhichao
    Westhof, Eric
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2023, 91 (12) : 1747 - 1770
  • [7] Estimation of model accuracy in CASP15 using the ModFOLDdock server
    Edmunds, Nicholas S.
    Alharbi, Shuaa M. A.
    Genc, Ahmet G.
    Adiyaman, Recep
    McGuffin, Liam J.
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2023, 91 (12) : 1871 - 1878
  • [8] Evans R, 2022, bioRxiv, DOI DOI 10.1101/2021.10.04.463034
  • [9] Applying and improving AlphaFold at CASP14
    Jumper, John
    Evans, Richard
    Pritzel, Alexander
    Green, Tim
    Figurnov, Michael
    Ronneberger, Olaf
    Tunyasuvunakool, Kathryn
    Bates, Russ
    Zidek, Augustin
    Potapenko, Anna
    Bridgland, Alex
    Meyer, Clemens
    Kohl, Simon A. A.
    Ballard, Andrew J.
    Cowie, Andrew
    Romera-Paredes, Bernardino
    Nikolov, Stanislav
    Jain, Rishub
    Adler, Jonas
    Back, Trevor
    Petersen, Stig
    Reiman, David
    Clancy, Ellen
    Zielinski, Michal
    Steinegger, Martin
    Pacholska, Michalina
    Berghammer, Tamas
    Silver, David
    Vinyals, Oriol
    Senior, Andrew W.
    Kavukcuoglu, Koray
    Kohli, Pushmeet
    Hassabis, Demis
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) : 1711 - 1721
  • [10] Highly accurate protein structure prediction with AlphaFold
    Jumper, John
    Evans, Richard
    Pritzel, Alexander
    Green, Tim
    Figurnov, Michael
    Ronneberger, Olaf
    Tunyasuvunakool, Kathryn
    Bates, Russ
    Zidek, Augustin
    Potapenko, Anna
    Bridgland, Alex
    Meyer, Clemens
    Kohl, Simon A. A.
    Ballard, Andrew J.
    Cowie, Andrew
    Romera-Paredes, Bernardino
    Nikolov, Stanislav
    Jain, Rishub
    Adler, Jonas
    Back, Trevor
    Petersen, Stig
    Reiman, David
    Clancy, Ellen
    Zielinski, Michal
    Steinegger, Martin
    Pacholska, Michalina
    Berghammer, Tamas
    Bodenstein, Sebastian
    Silver, David
    Vinyals, Oriol
    Senior, Andrew W.
    Kavukcuoglu, Koray
    Kohli, Pushmeet
    Hassabis, Demis
    [J]. NATURE, 2021, 596 (7873) : 583 - +