AF2Complex predicts direct physical interactions in multimeric proteins with deep learning

被引:156
作者
Gao, Mu [1 ]
An, Davi Nakajima [2 ]
Parks, Jerry M. [3 ]
Skolnick, Jeffrey [1 ]
机构
[1] Sch Biol Sci, Ctr Study Syst Biol, Atlanta, GA 30332 USA
[2] Georgia Inst Technol, Sch Comp Sci, Atlanta, GA 30332 USA
[3] Oak Ridge Natl Lab, Biosci Div, Oak Ridge, TN USA
关键词
HEME TRAFFICKING; DOCKING; COMPLEX; RESOURCE; RESIDUE;
D O I
10.1038/s41467-022-29394-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate descriptions of protein-protein interactions are essential for understanding biological systems. Here the authors present AF2Complex and show that application to the E. coli cytochrome biogenesis system I yields confident computational models for three sought-after assemblies. Accurate descriptions of protein-protein interactions are essential for understanding biological systems. Remarkably accurate atomic structures have been recently computed for individual proteins by AlphaFold2 (AF2). Here, we demonstrate that the same neural network models from AF2 developed for single protein sequences can be adapted to predict the structures of multimeric protein complexes without retraining. In contrast to common approaches, our method, AF2Complex, does not require paired multiple sequence alignments. It achieves higher accuracy than some complex protein-protein docking strategies and provides a significant improvement over AF-Multimer, a development of AlphaFold for multimeric proteins. Moreover, we introduce metrics for predicting direct protein-protein interactions between arbitrary protein pairs and validate AF2Complex on some challenging benchmark sets and the E. coli proteome. Lastly, using the cytochrome c biogenesis system I as an example, we present high-confidence models of three sought-after assemblies formed by eight members of this system.
引用
收藏
页数:13
相关论文
共 66 条
[1]   Structure-based assembly of protein complexes in yeast [J].
Aloy, P ;
Böttcher, B ;
Ceulemans, H ;
Leutwein, C ;
Mellwig, C ;
Fischer, S ;
Gavin, AC ;
Bork, P ;
Superti-Furga, G ;
Serrano, L ;
Russell, RB .
SCIENCE, 2004, 303 (5666) :2026-2029
[2]   Large-scale identification of protein-protein interaction of Escherichia coli K-12 [J].
Arifuzzaman, M ;
Maeda, M ;
Itoh, A ;
Nishikata, K ;
Takita, C ;
Saito, R ;
Ara, T ;
Nakahigashi, K ;
Huang, HC ;
Hirai, A ;
Tsuzuki, K ;
Nakamura, S ;
Altaf-Ul-Amin, M ;
Oshima, T ;
Baba, T ;
Yamamoto, N ;
Kawamura, T ;
Ioka-Nakamichi, T ;
Kitagawa, M ;
Tomita, M ;
Kanaya, S ;
Wada, C ;
Mori, H .
GENOME RESEARCH, 2006, 16 (05) :686-691
[3]   Accurate prediction of protein structures and interactions using a three-track neural network [J].
Baek, Minkyung ;
DiMaio, Frank ;
Anishchenko, Ivan ;
Dauparas, Justas ;
Ovchinnikov, Sergey ;
Lee, Gyu Rie ;
Wang, Jue ;
Cong, Qian ;
Kinch, Lisa N. ;
Schaeffer, R. Dustin ;
Millan, Claudia ;
Park, Hahnbeom ;
Adams, Carson ;
Glassman, Caleb R. ;
DeGiovanni, Andy ;
Pereira, Jose H. ;
Rodrigues, Andria V. ;
van Dijk, Alberdina A. ;
Ebrecht, Ana C. ;
Opperman, Diederik J. ;
Sagmeister, Theo ;
Buhlheller, Christoph ;
Pavkov-Keller, Tea ;
Rathinaswamy, Manoj K. ;
Dalwadi, Udit ;
Yip, Calvin K. ;
Burke, John E. ;
Garcia, K. Christopher ;
Grishin, Nick V. ;
Adams, Paul D. ;
Read, Randy J. ;
Baker, David .
SCIENCE, 2021, 373 (6557) :871-+
[4]   DockQ: A Quality Measure for Protein-Protein Docking Models [J].
Basu, Sankar ;
Wallner, Bjorn .
PLOS ONE, 2016, 11 (08)
[5]   UniProt: a worldwide hub of protein knowledge [J].
Bateman, Alex ;
Martin, Maria-Jesus ;
Orchard, Sandra ;
Magrane, Michele ;
Alpi, Emanuele ;
Bely, Benoit ;
Bingley, Mark ;
Britto, Ramona ;
Bursteinas, Borisas ;
Busiello, Gianluca ;
Bye-A-Jee, Hema ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Georghiou, George ;
Gonzales, Daniel ;
Gonzales, Leonardo ;
Hatton-Ellis, Emma ;
Ignatchenko, Alexandr ;
Ishtiaq, Rizwan ;
Jokinen, Petteri ;
Joshi, Vishal ;
Jyothi, Dushyanth ;
Lopez, Rodrigo ;
Luo, Jie ;
Lussi, Yvonne ;
MacDougall, Alistair ;
Madeira, Fabio ;
Mahmoudy, Mahdi ;
Menchi, Manuela ;
Nightingale, Andrew ;
Onwubiko, Joseph ;
Palka, Barbara ;
Pichler, Klemens ;
Pundir, Sangya ;
Qi, Guoying ;
Raj, Shriya ;
Renaux, Alexandre ;
Lopez, Milagros Rodriguez ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Speretta, Elena ;
Turner, Edward ;
Tyagi, Nidhi ;
Vasudev, Preethi ;
Volynkin, Vladimir ;
Wardell, Tony .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D506-D515
[6]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[7]   Architecture of the membrane-bound cytochrome c heme lyase CcmF [J].
Brausemann, Anton ;
Zhang, Lin ;
Ilcu, Lorena ;
Einsle, Oliver .
NATURE CHEMICAL BIOLOGY, 2021, 17 (07) :800-805
[8]  
Bryant P., 2021, IMPROVED PREDICTION
[9]   Interaction network containing conserved and essential protein complexes in Escherichia coli [J].
Butland, G ;
Peregrín-Alvarez, JM ;
Li, J ;
Yang, WH ;
Yang, XC ;
Canadien, V ;
Starostine, A ;
Richards, D ;
Beattie, B ;
Krogan, N ;
Davey, M ;
Parkinson, J ;
Greenblatt, J ;
Emili, A .
NATURE, 2005, 433 (7025) :531-537
[10]   M-TASSER: An algorithm for protein quaternary structure prediction [J].
Chen, Huiling ;
Skolnick, Jeffrey .
BIOPHYSICAL JOURNAL, 2008, 94 (03) :918-928