Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences

被引:69
作者
Green, Anna G. [1 ]
Elhabashy, Hadeer [2 ,3 ,4 ]
Brock, Kelly P. [1 ]
Maddamsetti, Rohan [1 ]
Kohlbacher, Oliver [2 ,3 ,4 ,5 ,6 ]
Marks, Debora S. [3 ,7 ]
机构
[1] Harvard Med Sch, Dept Syst Biol, Boston, MA 02115 USA
[2] Max Planck Inst Dev Biol, Biomol Interact, D-72076 Tubingen, Germany
[3] Univ Tubingen, Inst Bioinformat & Med Informat, Sand 14, D-72076 Tubingen, Germany
[4] Univ Tubingen, Dept Comp Sci, WSI ZBIT, Sand 14, D-72076 Tubingen, Germany
[5] Univ Tubingen, Quantitat Biol Ctr, Morgenstelle 8, D-72076 Tubingen, Germany
[6] Univ Tubingen Hosp, Inst Translat Bioinformat, Sand 14, D-72076 Tubingen, Germany
[7] Broad Inst Harvard & MIT, Cambridge, MA 02142 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
INTERACTION NETWORKS; CELL-ENVELOPE; COMPLEXES; IDENTIFICATION; LANDSCAPE; TAXONOMY;
D O I
10.1038/s41467-021-21636-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Increasing numbers of protein interactions have been identified in high-throughput experiments, but only a small proportion have solved structures. Recently, sequence coevolution-based approaches have led to a breakthrough in predicting monomer protein structures and protein interaction interfaces. Here, we address the challenges of large-scale interaction prediction at residue resolution with a fast alignment concatenation method and a probabilistic score for the interaction of residues. Importantly, this method (EVcomplex2) is able to assess the likelihood of a protein interaction, as we show here applied to large-scale experimental datasets where the pairwise interactions are unknown. We predict 504 interactions de novo in the E. coli membrane proteome, including 243 that are newly discovered. While EVcomplex2 does not require available structures, coevolving residue pairs can be used to produce structural models of protein interactions, as done here for membrane complexes including the Flagellar Hook-Filament Junction and the Tol/Pal complex. Our understanding of the residue-level details of protein interactions remains incomplete. Here, the authors show sequence coevolution can be used to infer interacting proteins with residue-level details, including predicting 467 interactions de novo in the Escherichia coli cell envelope proteome.
引用
收藏
页数:12
相关论文
共 56 条
[1]  
[Anonymous], 2006, The protein data bank, DOI DOI 10.1107/97809553602060000722
[2]   Global landscape of cell envelope protein complexes in Escherichia coli [J].
Babu, Mohan ;
Bundalovic-Torma, Cedoljub ;
Calmettes, Charles ;
Phanse, Sadhna ;
Zhang, Qingzhou ;
Jiang, Yue ;
Minic, Zoran ;
Kim, Sunyoung ;
Mehla, Jitender ;
Gagarinova, Alla ;
Rodionova, Irina ;
Kumar, Ashwani ;
Guo, Hongbo ;
Kagan, Olga ;
Pogoutse, Oxana ;
Aoki, Hiroyuki ;
Deineko, Viktor ;
Caufield, J. Harry ;
Holtzapple, Erik ;
Zhang, Zhongge ;
Vastermark, Ake ;
Pandya, Yogee ;
Lai, Christine Chieh-lin ;
El Bakkouri, Majida ;
Hooda, Yogesh ;
Shah, Megha ;
Burnside, Dan ;
Hooshyar, Mohsen ;
Vlasblom, James ;
Rajagopala, Sessandra V. ;
Golshani, Ashkan ;
Wuchty, Stefan ;
Greenblatt, Jack F. ;
Saier, Milton ;
Uetz, Peter ;
Moraes, Trevor F. ;
Parkinson, John ;
Emili, Andrew .
NATURE BIOTECHNOLOGY, 2018, 36 (01) :103-+
[3]   Learning generative models for protein fold families [J].
Balakrishnan, Sivaraman ;
Kamisetty, Hetunandan ;
Carbonell, Jaime G. ;
Lee, Su-In ;
Langmead, Christopher James .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (04) :1061-1078
[4]   Inferring interaction partners from protein sequences using mutual information [J].
Bitbol, Anne-Florence .
PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (11)
[5]   Inferring interaction partners from protein sequences [J].
Bitbol, Anne-Florence ;
Dwyer, Robert S. ;
Colwell, Lucy J. ;
Wingreen, Ned S. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (43) :12180-12185
[6]   Allosteric β-propeller signalling in TolB and its manipulation by translocating colicins [J].
Bonsor, Daniel A. ;
Hecht, Oliver ;
Vankemmelbeke, Mireille ;
Sharma, Amit ;
Krachler, Anne Marie ;
Housden, Nicholas G. ;
Lilly, Katie J. ;
James, Richard ;
Moore, Geoffrey R. ;
Kleanthous, Colin .
EMBO JOURNAL, 2009, 28 (18) :2846-2857
[7]   Version 1.2 of the Crystallography and NMR system [J].
Brunger, Axel T. .
NATURE PROTOCOLS, 2007, 2 (11) :2728-2733
[8]   Structure of FlgK reveals the divergence of the bacterial Hook-Filament Junction of Campylobacter [J].
Bulieris, Paula V. ;
Shaikh, Nausad H. ;
Freddolino, Peter L. ;
Samatey, Fadel A. .
SCIENTIFIC REPORTS, 2017, 7
[9]   Interaction network containing conserved and essential protein complexes in Escherichia coli [J].
Butland, G ;
Peregrín-Alvarez, JM ;
Li, J ;
Yang, WH ;
Yang, XC ;
Canadien, V ;
Starostine, A ;
Richards, D ;
Beattie, B ;
Krogan, N ;
Davey, M ;
Parkinson, J ;
Greenblatt, J ;
Emili, A .
NATURE, 2005, 433 (7025) :531-537
[10]   NATURE OF ACCESSIBLE AND BURIED SURFACES IN PROTEINS [J].
CHOTHIA, C .
JOURNAL OF MOLECULAR BIOLOGY, 1976, 105 (01) :1-14