Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network

被引:9
作者
Ghulam, Ali [1 ]
Lei, Xiujuan [1 ]
Zhang, Yuchen [1 ]
Cheng, Shi [1 ]
Guo, Min [1 ]
机构
[1] Shaanxi Normal Univ, Sch Comp Sci, Xian 710119, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国国家自然科学基金;
关键词
Proteins; Cancer; Feature extraction; Machine learning; Optimization; Two dimensional displays; Molecular structure prediction; deep learning; convolutional neural network; evolutionary knowledge; multiple features; features extraction; AMINO-ACID-COMPOSITION; SIGNAL-TRANSDUCTION; PREDICTION; CLATHRIN; GPS2;
D O I
10.1109/ACCESS.2020.3027887
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pathway-specific protein domain (PSPD) are associated with specific pathways. Many protein domains are pervasive in various biological processes, whereas other domains are linked to specific pathways. Many human disease pathways, such as cancer pathways and signaling pathway-related diseases, have caused the loss of functional PSPD. Therefore, the creation of an accurate method to predict its roles is a critical step toward human disease and pathways. In this study, we proposed a deep learning model based on a two-dimensional neural network (2D-CNN-PSPD) with a pathway-specific protein domain association prediction. In terms of the purposes of a sub-pathway, its parent pathway and its super pathway are linked to the Uni-Pathway. We also proposed a dipeptide composition (DPC) model and a dipeptide deviation (DDE) model of feature extraction profiles as PSSM. Then, we predicted the proteins associated with the same sub-pathway or with the same organism. The DDE model and DPC model of the PSSM feature profile input was associated with our proposed 2D-CNN method. We deployed several parameters to optimize the model's output performance and used the hyperparameter optimization approach to find the best model for our dataset based on the 10-fold cross-validation results. Ultimately, we assessed the predictive performance of the current model by using independent datasets and cross-validation datasets. Therefore, we enhanced the efficiency of deep learning methods. PSPD is involved in any known pathway and then follow the association in different stages of the pathway hierarchy with other proteins. Our proposed method could identify 2D-CNN-PSPD with 0.83% sensitivity, 0.92% specificity, 87.27% accuracy, and 0.75% accuracy. We provided an important method for the analysis of PSPD proteins in the proposed research, and our achievements might promote computational biological research. We concluded our proposed model architecture in the future, the use of the latest features, and the multi-one structure to predict different types of molecules, such as DNA, RNA, and disease-pathway specific proteins associations.
引用
收藏
页码:180140 / 180155
页数:16
相关论文
共 49 条
  • [1] ABADI M, 2018, TENSORFLOW LARGE SCA
  • [2] DEEPCON: protein contact prediction using dilated convolutional neural networks with dropout
    Adhikari, Badri
    [J]. BIOINFORMATICS, 2020, 36 (02) : 470 - 477
  • [3] [Anonymous], 2018, Deep Learning for Computer Vision: Expert techniques to train advanced neural networks using TensorFlow and Keras
  • [4] [Anonymous], 2016, BMVC
  • [5] RNA-sequencing analysis revealed genes associated drought stress responses of different durations in hexaploid sweet potato
    Arisha, Mohamed Hamed
    Ahmad, Muhammad Qadir
    Tang, Wei
    Liu, Yaju
    Yan, Hui
    Kou, Meng
    Wang, Xin
    Zhang, Yungang
    Li, Qiang
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [6] Protein secondary structure prediction using modular reciprocal bidirectional recurrent neural networks
    Babaei, Sepideh
    Geranmayeh, Amir
    Seyyedsalehi, Seyyed Ali
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2010, 100 (03) : 237 - 247
  • [7] SUMOylation of GPS2 protein regulates its transcription-suppressing function
    Bi, Hailian
    Li, Shujing
    Wang, Miao
    Jia, Zhaojun
    Chang, Alan K.
    Pang, Pengsha
    Wu, Huijian
    [J]. MOLECULAR BIOLOGY OF THE CELL, 2014, 25 (16) : 2499 - 2508
  • [8] Mitochondrial Retrograde Signaling in Mammals Is Mediated by the Transcriptional Cofactor GPS2 via Direct Mitochondria-to-Nucleus Translocation
    Cardamone, Maria Dafne
    Tanasa, Bogdan
    Cederquist, Carly T.
    Huang, Jiawen
    Mahdaviani, Kiana
    Li, Wenbo
    Rosenfeld, Michael G.
    Liesa, Marc
    Perissi, Valentina
    [J]. MOLECULAR CELL, 2018, 69 (05) : 757 - +
  • [9] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [10] Homology among (βα)8 barrels:: Implications for the evolution of metabolic pathways
    Copley, RR
    Bork, P
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2000, 303 (04) : 627 - 640