Mining of protein-protein interfacial residues from massive protein sequential and spatial data

被引:5
|
作者
Wang, Debby D. [1 ]
Zhou, Weiqiang [1 ]
Yan, Hong [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Kowloon, Hong Kong, Peoples R China
关键词
Protein-protein interface prediction; 3D alpha shape modeling; Residue sequence profile; Joint mutual information (JMI); Neuro-fuzzy classifiers (NFCs); Neighborhood classifiers (NECs); CART; Extreme learning machines (ELMs); Naive Bayesian classifiers (NBCs); BIG DATA; INTERACTION SITES; DATA-BANK; INFORMATION; PREDICTION; NETWORK;
D O I
10.1016/j.fss.2014.01.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
It is a great challenge to process big data in bioinformatics. In this paper, we addressed the problem of identifying protein-protein interfacial residues from massive protein structural data. A protein set, comprising 154993 residues, was analyzed. We applied the three-dimensional alpha shape modeling to the search of surface and interfacial residues in this set, and adopted the spatially neighboring residue profiles to characterize each residue. These residue profiles, which revealed the sequential and spatial information of proteins, translated the original data into a large matrix. After vertically and horizontally refining this matrix, we comparably implemented a series of popular learning procedures, including neuro-fuzzy classifiers (NFCs), CART, neighborhood classifiers (NECs), extreme learning machines (ELMs) and naive Bayesian classifiers (NBCs), to predict the interfacial residues, aiming to investigate the sensitivity of these massive structural data to different learning mechanisms. As a consequence, ELMs, CART and NFCs performed better in terms of computational costs; NFCs, NBCs and ELMs provided favorable prediction accuracies. Overall, NFCs, NBCs and ELMs are favourable choices for fastly and accurately handling this type of data. More importantly, the marginal differences between the prediction performances of these methods imply the insensitivity of this type of data to different learning mechanisms. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:101 / 116
页数:16
相关论文
共 50 条
  • [41] Mining functional subgraphs from cancer protein-protein interaction networks
    Shen, Ru
    Goonesekere, Nalin C. W.
    Guda, Chittibabu
    BMC SYSTEMS BIOLOGY, 2012, 6
  • [42] Identification of hot spot residues at protein-protein interface
    Li, Lei
    Zhao, Bing
    Cui, Zhanhua
    Gan, Jacob
    Sakharkar, Meena Kishore
    Kangueane, Pandjassarame
    BIOINFORMATION, 2006, 1 (04) : 121 - +
  • [43] Unique Physicochemical Patterns of Residues in Protein-Protein Interfaces
    Lazar, Tamas
    Guharoy, Mainak
    Schad, Eva
    Tompa, Peter
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2018, 58 (10) : 2164 - 2173
  • [44] Hippocampal protein-protein interactions in spatial memory
    Nelson, TJ
    Backlund, PS
    Alkon, DL
    HIPPOCAMPUS, 2004, 14 (01) : 46 - 57
  • [45] Discovering Protein Complexes from Protein-Protein Interaction Data by Dense Subgraph
    LIU Bin1
    2. State Key Laboratory of Software Engineering
    Wuhan University Journal of Natural Sciences, 2011, 16 (01) : 64 - 68
  • [46] Assessment of prediction accuracy of protein function from protein-protein interaction data
    Hishigaki, H
    Nakai, K
    Ono, T
    Tanigami, A
    Takagi, T
    YEAST, 2001, 18 (06) : 523 - 531
  • [47] Integrating protein-protein interactions and text mining for protein function prediction
    Jaeger, Samira
    Gaudan, Sylvain
    Leser, Ulf
    Rebholz-Schuhmann, Dietrich
    BMC BIOINFORMATICS, 2008, 9 (Suppl 8)
  • [48] Integrating protein-protein interactions and text mining for protein function prediction
    Samira Jaeger
    Sylvain Gaudan
    Ulf Leser
    Dietrich Rebholz-Schuhmann
    BMC Bioinformatics, 9
  • [49] Mining protein-protein interaction networks: denoising effects
    Marras, Elisabetta
    Capobianco, Enrico
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2009,
  • [50] Scalable Mining and Analysis of Protein-Protein Interaction Networks
    Arifuzzaman, Shaikh
    Pandey, Bikesh
    2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 1098 - 1105