A prediction model of nonclassical secreted protein based on deep learning

被引:0
|
作者
Zhang, Fan [1 ,2 ]
Liu, Chaoyang [2 ]
Wang, Binjie [1 ]
He, Yiru [3 ]
Zhang, Xinhong [3 ]
机构
[1] Henan Univ, Huaihe Hosp, Radiol Dept, Kaifeng, Peoples R China
[2] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
[3] Henan Univ, Sch Software, Kaifeng 475004, Peoples R China
关键词
bioinformatics; deep learning; nonclassical secreted protein; prediction; WEB SERVER; PLASMA; CLASSIFICATION;
D O I
10.1002/cem.3553
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the current nonclassical proteins prediction methods involve manual feature selection, such as constructing features of samples based on the physicochemical properties of proteins and position-specific scoring matrix (PSSM). However, these tasks require researchers to perform some tedious search work to obtain the physicochemical properties of proteins. This paper proposes an end-to-end nonclassical secreted protein prediction model based on deep learning, named DeepNCSPP, which employs the protein sequence information and sequence statistics information as input to predict whether it is a nonclassical secreted protein. The protein sequence information and sequence statistics information are extracted using bidirectional long- and short-term memory and convolutional neural networks, respectively. Among the experiments conducted on the independent test dataset, DeepNCSPP achieved excellent results with an accuracy of 88.24%, Matthews coefficient (MCC) of 77.01%, and F1-score of 87.50%. Independent test dataset testing and 10-fold cross-validation show that DeepNCSPP achieves competitive performance with state-of-the-art methods and can be used as a reliable nonclassical secreted protein prediction model. A web server has been constructed for the convenience of researchers. The web link is . The source code of DeepNCSPP has been hosted on GitHub and is available online (). This paper proposes an end-to-end nonclassical secreted protein prediction model DeepNCSPP based on deep learning, which employs the protein sequence information and sequence statistics information as input to predict whether it is a nonclassical secreted protein. The protein sequence information and sequence statistics information are extracted using bidirectional long- and short-term memory and convolutional neural networks, respectively.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] miGAP: miRNA-Gene Association Prediction Method Based on Deep Learning Model
    Yoon, Seungwon
    Hwang, Inwoo
    Cho, Jaeeun
    Yoon, Hyewon
    Lee, Kyuchul
    Minkiewicz, Piotr
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [42] PYTHIA: Deep Learning Approach for Local Protein Conformation Prediction
    Cretin, Gabriel
    Galochkina, Tatiana
    de Brevern, Alexandre G.
    Gelly, Jean-Christophe
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (16)
  • [43] DNA-binding protein prediction based on deep transfer learning
    Yan, Jun
    Jiang, Tengsheng
    Liu, Junkai
    Lu, Yaoyao
    Guan, Shixuan
    Li, Haiou
    Wu, Hongjie
    Ding, Yijie
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (08) : 7719 - 7736
  • [44] Recent developments in deep learning applied to protein structure prediction
    Kandathil, Shaun M.
    Greener, Joe G.
    Jones, David T.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2019, 87 (12) : 1179 - 1189
  • [45] Prediction of human protein subcellular localization using deep learning
    Wei, Leyi
    Ding, Yijie
    Su, Ran
    Tang, Jijun
    Zou, Quan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 117 : 212 - 217
  • [46] DeepSuccinylSite: a deep learning based approach for protein succinylation site prediction
    Niraj Thapa
    Meenal Chaudhari
    Sean McManus
    Kaushik Roy
    Robert H. Newman
    Hiroto Saigo
    Dukka B. KC
    BMC Bioinformatics, 21
  • [47] Protein-Ligand Binding Affinity Prediction Based on Deep Learning
    Lu, Yaoyao
    Liu, Junkai
    Jiang, Tengsheng
    Guan, Shixuan
    Wu, Hongjie
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 310 - 316
  • [48] DeepSuccinylSite: a deep learning based approach for protein succinylation site prediction
    Thapa, Niraj
    Chaudhari, Meenal
    McManus, Sean
    Roy, Kaushik
    Newman, Robert H.
    Saigo, Hiroto
    KC, Dukka B.
    BMC BIOINFORMATICS, 2020, 21 (Suppl 3)
  • [49] Protein deep profile and model predictions for identifying the causal genes of male infertility based on deep learning
    Xu, Fang
    Guo, Ganggang
    Zhu, Feida
    Tan, Xiaojun
    Fan, Liqing
    INFORMATION FUSION, 2021, 75 : 70 - 89
  • [50] Enhancing resiliency feature in smart grids through a deep learning based prediction model
    Khediri A.
    Laouar M.R.
    Eom S.B.
    Recent Advances in Computer Science and Communications, 2020, 13 (03) : 508 - 518