Mining of protein-protein interfacial residues from massive protein sequential and spatial data

被引:5
作者
Wang, Debby D. [1 ]
Zhou, Weiqiang [1 ]
Yan, Hong [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Kowloon, Hong Kong, Peoples R China
关键词
Protein-protein interface prediction; 3D alpha shape modeling; Residue sequence profile; Joint mutual information (JMI); Neuro-fuzzy classifiers (NFCs); Neighborhood classifiers (NECs); CART; Extreme learning machines (ELMs); Naive Bayesian classifiers (NBCs); BIG DATA; INTERACTION SITES; DATA-BANK; INFORMATION; PREDICTION; NETWORK;
D O I
10.1016/j.fss.2014.01.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
It is a great challenge to process big data in bioinformatics. In this paper, we addressed the problem of identifying protein-protein interfacial residues from massive protein structural data. A protein set, comprising 154993 residues, was analyzed. We applied the three-dimensional alpha shape modeling to the search of surface and interfacial residues in this set, and adopted the spatially neighboring residue profiles to characterize each residue. These residue profiles, which revealed the sequential and spatial information of proteins, translated the original data into a large matrix. After vertically and horizontally refining this matrix, we comparably implemented a series of popular learning procedures, including neuro-fuzzy classifiers (NFCs), CART, neighborhood classifiers (NECs), extreme learning machines (ELMs) and naive Bayesian classifiers (NBCs), to predict the interfacial residues, aiming to investigate the sensitivity of these massive structural data to different learning mechanisms. As a consequence, ELMs, CART and NFCs performed better in terms of computational costs; NFCs, NBCs and ELMs provided favorable prediction accuracies. Overall, NFCs, NBCs and ELMs are favourable choices for fastly and accurately handling this type of data. More importantly, the marginal differences between the prediction performances of these methods imply the insensitivity of this type of data to different learning mechanisms. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:101 / 116
页数:16
相关论文
共 50 条
  • [41] Specificity and stability of transient protein-protein interactions
    Vishwanath, Sneha
    Sukhwal, Anshul
    Sowdhamini, Ramanathan
    Srinivasan, Narayanaswamy
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2017, 44 : 77 - 86
  • [42] Prediction of protein-protein interactions from primary sequences
    Dong, Qiwen
    Zhou, Shuigeng
    Liu, Xuan
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (02) : 211 - 227
  • [43] Protein-protein interaction networks: from interactions to networks
    Cho, SY
    Park, SG
    Lee, DH
    Park, BC
    JOURNAL OF BIOCHEMISTRY AND MOLECULAR BIOLOGY, 2004, 37 (01): : 45 - 52
  • [44] PPInS: a repository of protein-protein interaction sitesbase
    Kumar, Vicky
    Mahato, Suchismita
    Munshi, Anjana
    Kulharia, Mahesh
    SCIENTIFIC REPORTS, 2018, 8
  • [45] Alternative Protein-Protein Interfaces Are Frequent Exceptions
    Hamp, Tobias
    Rost, Burkhard
    PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (08)
  • [46] Protein-protein binding supersites
    Viswanathan, Raji
    Fajardo, Eduardo
    Steinberg, Gabriel
    Haller, Matthew
    Fiser, Andras
    PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (01)
  • [47] Template-based protein-protein docking exploiting pairwise interfacial residue restraints
    Xue, Li C.
    Rodrigues, Joao P. G. L. M.
    Dobbs, Drena
    Honavar, Vasant
    Bonvin, Alexandre M. J. J.
    BRIEFINGS IN BIOINFORMATICS, 2017, 18 (03) : 458 - 466
  • [48] Predictive Models and Impact of Interfacial Contacts and Amino Acids on Protein-Protein Binding Affinity
    Yi, Carey Huang
    Taylor, Mitchell Lee
    Ziebarth, Jesse
    Wang, Yongmei
    ACS OMEGA, 2024, 9 (03): : 3454 - 3468
  • [49] Extracting Coevolutionary Features from Protein Sequences for Predicting Protein-Protein Interactions
    Hu, Lun
    Chan, Keith C. C.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (01) : 155 - 166
  • [50] Computational design, construction, and characterization of a set of specificity determining residues in protein-protein interactions
    Nagao, Chioko
    Izako, Nozomi
    Soga, Shinji
    Khan, Samia Haseeb
    Kawabata, Shigeki
    Shirai, Hiroki
    Mizuguchi, Kenji
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2012, 80 (10) : 2426 - 2436