Cluster based prediction of PDZ-peptide interactions

被引:14
作者
Kundu, Kousik [1 ]
Backofen, Rolf [1 ,2 ,3 ,4 ]
机构
[1] Univ Freiburg, Dept Comp Sci, Bioinformat Grp, Freiburg, Germany
[2] Univ Freiburg, Ctr Biol Signalling Studies BIOSS, Freiburg, Germany
[3] Univ Freiburg, Ctr Biol Syst Anal ZBSA, Freiburg, Germany
[4] Univ Copenhagen, Ctr Noncoding RNA Technol & Hlth, DK-1870 Frederiksberg C, Denmark
来源
BMC GENOMICS | 2014年 / 15卷
关键词
PROTEIN INTERACTIONS; DOMAIN; RECOGNITION; CLASSIFICATION; FRAMEWORK; DISPLAY;
D O I
10.1186/1471-2164-15-S1-S5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: PDZ domains are one of the most promiscuous protein recognition modules that bind with short linear peptides and play an important role in cellular signaling. Recently, few high-throughput techniques (e.g. protein microarray screen, phage display) have been applied to determine in-vitro binding specificity of PDZ domains. Currently, many computational methods are available to predict PDZ-peptide interactions but they often provide domain specific models and/or have a limited domain coverage. Results: Here, we composed the largest set of PDZ domains derived from human, mouse, fly and worm proteomes and defined binding models for PDZ domain families to improve the domain coverage and prediction specificity. For that purpose, we first identified a novel set of 138 PDZ families, comprising of 548 PDZ domains from aforementioned organisms, based on efficient clustering according to their sequence identity. For 43 PDZ families, covering 226 PDZ domains with available interaction data, we built specialized models using a support vector machine approach. The advantage of family-wise models is that they can also be used to determine the binding specificity of a newly characterized PDZ domain with sufficient sequence identity to the known families. Since most current experimental approaches provide only positive data, we have to cope with the class imbalance problem. Thus, to enrich the negative class, we introduced a powerful semi-supervised technique to generate high confidence non-interaction data. We report competitive predictive performance with respect to state-of-the-art approaches. Conclusions: Our approach has several contributions. First, we show that domain coverage can be increased by applying accurate clustering technique. Second, we developed an approach based on a semi-supervised strategy to get high confidence negative data. Third, we allowed high order correlations between the amino acid positions in the binding peptides. Fourth, our method is general enough and will easily be applicable to other peptide recognition modules such as SH2 domains and finally, we performed a genome-wide prediction for 101 human and 102 mouse PDZ domains and uncovered novel interactions with biological relevance. We make all the predictive models and genome-wide predictions freely available to the scientific community.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [11] Peptide Targeting of PDZ-Dependent Interactions as Pharmacological Intervention in Immune-Related Diseases
    Gutierrez-Gonzalez, Luis H.
    Rivas-Fuentes, Selma
    Guzman-Beltran, Silvia
    Flores-Flores, Angelica
    Rosas-Garcia, Jorge
    Santos-Mendoza, Teresa
    [J]. MOLECULES, 2021, 26 (21):
  • [12] Structure-based prediction of T cell receptor:peptide-MHC interactions
    Bradley, Philip
    [J]. ELIFE, 2023, 12
  • [13] Characterization of PDZ domain-peptide interaction interface based on energetic patterns
    Li, Nan
    Hou, Tingjun
    Ding, Bo
    Wang, Wei
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (11) : 3208 - 3220
  • [14] Proteomic peptide phage display uncovers novel interactions of the PDZ1-2 supramodule of syntenin
    Garrido-Urbani, Sarah
    Garg, Pankaj
    Ghossoub, Rania
    Arnold, Roland
    Lembo, Frederique
    Sundell, Gustav N.
    Kim, Philip M.
    Lopez, Marc
    Zimmermann, Pascale
    Sidhu, Sachdev S.
    Ivarsson, Ylva
    [J]. FEBS LETTERS, 2016, 590 (01) : 3 - 12
  • [15] Interaction prediction and classification of PDZ domains
    Kalyoncu, Sibel
    Keskin, Ozlem
    Gursoy, Attila
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [16] Deciphering peptide-protein interactions via composition-based prediction: a case study with survivin/BIRC5
    Anindya, Atsarina Larasati
    Olsson, Torbjorn Nur
    Jensen, Maja
    Garcia-Bonete, Maria-Jose
    Wheatley, Sally P.
    Bokarewa, Maria, I
    Mezzasalma, Stefano A.
    Katona, Gergely
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (02):
  • [17] A Systematic Family-wide Investigation Reveals that ∼30% of Mammalian PDZ Domains Engage in PDZ-PDZ Interactions
    Chang, Bryan H.
    Gujral, Taranjit S.
    Karp, Ethan S.
    BuKhalid, Raghida
    Grantcharova, Viara P.
    MacBeath, Gavin
    [J]. CHEMISTRY & BIOLOGY, 2011, 18 (09): : 1143 - 1152
  • [18] Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm
    Li, Bi-Qing
    Zhang, Yu-Hang
    Jin, Mei-Ling
    Huang, Tao
    Cai, Yu-Dong
    [J]. CURRENT BIOINFORMATICS, 2018, 13 (01) : 14 - 24
  • [19] A machine learning based method for the prediction of G protein-coupled receptor-binding PDZ domain proteins
    Eo, Hae-Seok
    Kim, Sungmin
    Koo, Hyeyoung
    Kim, Won
    [J]. MOLECULES AND CELLS, 2009, 27 (06) : 629 - 634
  • [20] Characterization of PDZ domain-peptide interactions using an integrated protocol of QM/MM, PB/SA, and CFEA analyses
    Tian, Feifei
    Lv, Yonggang
    Zhou, Peng
    Yang, Li
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2011, 25 (10) : 947 - 958