Combining Natural Sequence Variation with High Throughput Mutational Data to Reveal Protein Interaction Sites

被引:21
|
作者
Melamed, Daniel [1 ,2 ]
Young, David L. [2 ]
Miller, Christina R. [1 ,2 ]
Fields, Stanley [1 ,2 ,3 ]
机构
[1] Univ Washington, Howard Hughes Med Inst, Seattle, WA 98195 USA
[2] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[3] Univ Washington, Dept Med, Seattle, WA 98195 USA
来源
PLOS GENETICS | 2015年 / 11卷 / 02期
基金
美国国家卫生研究院;
关键词
YEAST POLY(A)-BINDING PROTEIN; MESSENGER-RNA; BINDING-PROTEIN; WEB SERVER; TRANSLATION; PREDICTION; POLY(A); DOMAIN; RECOGNITION; EVOLUTION;
D O I
10.1371/journal.pgen.1004918
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Many protein interactions are conserved among organisms despite changes in the amino acid sequences that comprise their contact sites, a property that has been used to infer the location of these sites from protein homology. In an inter-species complementation experiment, a sequence present in a homologue is substituted into a protein and tested for its ability to support function. Therefore, substitutions that inhibit function can identify interaction sites that changed over evolution. However, most of the sequence differences within a protein family remain unexplored because of the small-scale nature of these complementation approaches. Here we use existing high throughput mutational data on the in vivo function of the RRM2 domain of the Saccharomyces cerevisiae poly(A)-binding protein, Pab1, to analyze its sites of interaction. Of 197 single amino acid differences in 52 Pab1 homologues, 17 reduce the function of Pab1 when substituted into the yeast protein. The majority of these deleterious mutations interfere with the binding of the RRM2 domain to eIF4G1 and eIF4G2, isoforms of a translation initiation factor. A large-scale mutational analysis of the RRM2 domain in a two-hybrid assay for eIF4G1 binding supports these findings and identifies peripheral residues that make a smaller contribution to eIF4G1 binding. Three single amino acid substitutions in yeast Pab1 corresponding to residues from the human orthologue are deleterious and eliminate binding to the yeast eIF4G isoforms. We create a triple mutant that carries these substitutions and other humanizing substitutions that collectively support a switch in binding specificity of RRM2 from the yeast eIF4G1 to its human orthologue. Finally, we map other deleterious substitutions in Pab1 to inter-domain (RRM2-RRM1) or protein-RNA (RRM2-poly(A)) interaction sites. Thus, the combined approach of large-scale mutational data and evolutionary conservation can be used to characterize interaction sites at single amino acid resolution.
引用
收藏
页码:1 / 21
页数:21
相关论文
共 50 条
  • [31] Alignment-Free Ultra-High-Throughput Comparison of Druggable Protein-Ligand Binding Sites
    Weill, Nathanael
    Rognan, Didier
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (01) : 123 - 135
  • [32] ProteoMirExpress: Inferring MicroRNA and Protein-centered Regulatory Networks from High-throughput Proteomic and mRNA Expression Data
    Qin, Jing
    Li, Mulin Jun
    Wang, Panwen
    Wong, Nai Sum
    Wong, Maria P.
    Xia, Zhengyuan
    Tsao, George S. W.
    Zhang, Michael Q.
    Wang, Junwen
    MOLECULAR & CELLULAR PROTEOMICS, 2013, 12 (11) : 3379 - 3387
  • [33] Integration of Human Protein Sequence and Protein-Protein Interaction Data by Graph Autoencoder to Identify Novel Protein-Abnormal Phenotype Associations
    Liu, Yuan
    He, Ruirui
    Qu, Yingjie
    Zhu, Yuan
    Li, Dianke
    Ling, Xinping
    Xia, Simin
    Li, Zhenqiu
    Li, Dong
    CELLS, 2022, 11 (16)
  • [34] Fixed DNA Molecule Arrays for High-Throughput Single DNA-Protein Interaction Studies
    Tutkus, Marijonas
    Rakickas, Tomas
    Kopustas, Aurimas
    Ivanovaite, Sarune
    Venckus, Oskaras
    Navikas, Vytautas
    Zaremba, Mindaugas
    Manakova, Elena
    Valiokas, Ramunas
    LANGMUIR, 2019, 35 (17) : 5921 - 5930
  • [35] Fuzzy Logic as a Strategy for Combining Marker Statistics to Optimize Preselection of High-Density and Sequence Genotype Data
    Ling, Ashley
    Hay, El Hamidi
    Aggrey, Samuel E.
    Rekaya, Romdhane
    GENES, 2022, 13 (11)
  • [36] Combining techniques for screening and evaluating interaction terms on high-dimensional time-to-event data
    Sariyar, Murat
    Hoffmann, Isabell
    Binder, Harald
    BMC BIOINFORMATICS, 2014, 15
  • [37] Physicochemical graph neural network for learning protein-ligand interaction fingerprints from sequence data
    Koh, Huan Yee
    Nguyen, Anh T. N.
    Pan, Shirui
    May, Lauren T.
    Webb, Geoffrey I.
    NATURE MACHINE INTELLIGENCE, 2024, 6 (06) : 673 - +
  • [38] Physicochemical graph neural network for learning protein-ligand interaction fingerprints from sequence data
    Koh, Huan Yee
    Nguyen, Anh T. N.
    Pan, Shirui
    May, Lauren T.
    Webb, Geoffrey I.
    NATURE MACHINE INTELLIGENCE, 2024, : 673 - 687
  • [39] High-affinity RNA targets of the Survival Motor Neuron protein reveal diverse preferences for sequence and structural motifs
    Ottesen, Eric W.
    Singh, Natalia N.
    Luo, Diou
    Singh, Ravindra N.
    NUCLEIC ACIDS RESEARCH, 2018, 46 (20) : 10983 - 11001
  • [40] Improved estimation of stomatal conductance by combining high-throughput plant phenotyping data and weather variables through machine learning
    Zhang, Junxiao
    Thapa, Kantilata
    Bai, Geng
    Ge, Yufeng
    AGRICULTURAL WATER MANAGEMENT, 2025, 309