Protein structure prediction by AlphaFold2: are attention and symmetries all you need?

被引:33
作者
Bouatta, Nazim [1 ]
Sorger, Peter [1 ]
AlQuraishi, Mohammed [2 ]
机构
[1] Harvard Med Sch, Lab Syst Pharmacol, Boston, MA 02115 USA
[2] Columbia Univ, Dept Syst Biol, New York, NY 10032 USA
来源
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY | 2021年 / 77卷
关键词
AlphaFold2; protein structure prediction; CASP14; MOLECULAR-DYNAMICS SIMULATIONS; PATHWAYS;
D O I
10.1107/S2059798321007531
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The functions of most proteins result from their 3D structures, but determining their structures experimentally remains a challenge, despite steady advances in crystallography, NMR and single-particle cryoEM. Computationally predicting the structure of a protein from its primary sequence has long been a grand challenge in bioinformatics, intimately connected with understanding protein chemistry and dynamics. Recent advances in deep learning, combined with the availability of genomic data for inferring co-evolutionary patterns, provide a new approach to protein structure prediction that is complementary to longstanding physics-based approaches. The outstanding performance of AlphaFold2 in the recent Critical Assessment of protein Structure Prediction (CASP14) experiment demonstrates the remarkable power of deep learning in structure prediction. In this perspective, we focus on the key features of AlphaFold2, including its use of (i) attention mechanisms and Transformers to capture long-range dependencies, (ii) symmetry principles to facilitate reasoning over protein structures in three dimensions and (iii) end-to-end differentiability as a unifying framework for learning from protein data. The rules of protein folding are ultimately encoded in the physical principles that underpin it; to conclude, the implications of having a powerful computational model for structure prediction that does not explicitly rely on those principles are discussed.
引用
收藏
页码:982 / 991
页数:10
相关论文
共 68 条
[1]   Unified rational protein engineering with sequence-based deep representation learning [J].
Alley, Ethan C. ;
Khimulya, Grigory ;
Biswas, Surojit ;
AlQuraishi, Mohammed ;
Church, George M. .
NATURE METHODS, 2019, 16 (12) :1315-+
[2]   AlphaFold at CASP13 [J].
AlQuraishi, Mohammed .
BIOINFORMATICS, 2019, 35 (22) :4862-4865
[3]   End-to-End Differentiable Learning of Protein Structure [J].
AlQuraishi, Mohammed .
CELL SYSTEMS, 2019, 8 (04) :292-+
[4]   KINETICS OF FORMATION OF NATIVE RIBONUCLEASE DURING OXIDATION OF REDUCED POLYPEPTIDE CHAIN [J].
ANFINSEN, CB ;
HABER, E ;
SELA, M ;
WHITE, FH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1961, 47 (09) :1309-+
[5]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[6]  
[Anonymous], 2020, ALPHAFOLD SOL 50 YEA
[7]   Deep learning [J].
LeCun, Yann ;
Bengio, Yoshua ;
Hinton, Geoffrey .
NATURE, 2015, 521 (7553) :436-444
[8]   From Einstein, Bohr, Schrodinger to Bell and Feynman: A New Quantum Revolution? [J].
Aspect, Alain .
NIELS BOHR, 1913-2013, 2016, 68 :147-175
[9]  
Bell J.S., 2004, SPEAKABLE UNSPEAKABL, P14
[10]   PROTEIN MODELING Protein storytelling through physics [J].
Brini, Emiliano ;
Simmerling, Carlos ;
Dill, Ken .
SCIENCE, 2020, 370 (6520) :1056-+