Mining of protein-protein interfacial residues from massive protein sequential and spatial data

被引:5
|
作者
Wang, Debby D. [1 ]
Zhou, Weiqiang [1 ]
Yan, Hong [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Kowloon, Hong Kong, Peoples R China
关键词
Protein-protein interface prediction; 3D alpha shape modeling; Residue sequence profile; Joint mutual information (JMI); Neuro-fuzzy classifiers (NFCs); Neighborhood classifiers (NECs); CART; Extreme learning machines (ELMs); Naive Bayesian classifiers (NBCs); BIG DATA; INTERACTION SITES; DATA-BANK; INFORMATION; PREDICTION; NETWORK;
D O I
10.1016/j.fss.2014.01.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
It is a great challenge to process big data in bioinformatics. In this paper, we addressed the problem of identifying protein-protein interfacial residues from massive protein structural data. A protein set, comprising 154993 residues, was analyzed. We applied the three-dimensional alpha shape modeling to the search of surface and interfacial residues in this set, and adopted the spatially neighboring residue profiles to characterize each residue. These residue profiles, which revealed the sequential and spatial information of proteins, translated the original data into a large matrix. After vertically and horizontally refining this matrix, we comparably implemented a series of popular learning procedures, including neuro-fuzzy classifiers (NFCs), CART, neighborhood classifiers (NECs), extreme learning machines (ELMs) and naive Bayesian classifiers (NBCs), to predict the interfacial residues, aiming to investigate the sensitivity of these massive structural data to different learning mechanisms. As a consequence, ELMs, CART and NFCs performed better in terms of computational costs; NFCs, NBCs and ELMs provided favorable prediction accuracies. Overall, NFCs, NBCs and ELMs are favourable choices for fastly and accurately handling this type of data. More importantly, the marginal differences between the prediction performances of these methods imply the insensitivity of this type of data to different learning mechanisms. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:101 / 116
页数:16
相关论文
共 50 条
  • [31] Interfacial protein-protein displacement at fluid interfaces
    Hinderink, Emma B.A.
    Meinders, Marcel B.J.
    Miller, Reinhard
    Sagis, Leonard
    Schroën, Karin
    Berton-Carabin, Claire C.
    Advances in Colloid and Interface Science, 2022, 305
  • [32] Discover protein sequence signatures from protein-protein interaction data
    Jianwen Fang
    Ryan J Haasl
    Yinghua Dong
    Gerald H Lushington
    BMC Bioinformatics, 6
  • [33] Predicting protein-protein interactions by association mining
    Kotlyar, M
    Jurisica, I
    INFORMATION SYSTEMS FRONTIERS, 2006, 8 (01) : 37 - 46
  • [34] Predicting Protein-Protein Interactions by Association Mining
    Information Systems Frontiers, 2006, 8 : 37 - 47
  • [35] Discover protein sequence signatures from protein-protein interaction data
    Fang, JW
    Haasl, RJ
    Dong, YH
    Lushington, GH
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [36] Correlating Protein Hot Spot Surface Analysis Using ProBiS with Simulated Free Energies of Protein-Protein Interfacial Residues
    Carl, Nejc
    Hodoscek, Milan
    Vehar, Blaz
    Konc, Janez
    Brooks, Bernard R.
    Janezic, Dusanka
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (10) : 2541 - 2549
  • [37] Mining protein-protein interaction information on the Internet
    Lee, HC
    Huang, SW
    Li, EY
    EXPERT SYSTEMS WITH APPLICATIONS, 2006, 30 (01) : 142 - 148
  • [38] A Method of Integrating Spatial Proteomics and Protein-Protein Interaction Network Data
    Squires, Steven
    Ewing, Rob
    Prugel-Bennett, Adam
    Niranjan, Mahesan
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT V, 2017, 10638 : 782 - 790
  • [39] Extracting and mining protein-protein interaction network from biomedical literature
    Hu, XH
    Yoo, IH
    Song, IY
    Song, M
    Han, JC
    Lechner, M
    PROCEEDINGS OF THE 2004 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2004, : 244 - 251
  • [40] Implication of Terminal Residues at Protein-Protein and Protein-DNA Interfaces
    Martin, Olivier M. F.
    Etheve, Loic
    Launay, Guillaume
    Martin, Juliette
    PLOS ONE, 2016, 11 (09):