A prediction model of nonclassical secreted protein based on deep learning

被引:0
|
作者
Zhang, Fan [1 ,2 ]
Liu, Chaoyang [2 ]
Wang, Binjie [1 ]
He, Yiru [3 ]
Zhang, Xinhong [3 ]
机构
[1] Henan Univ, Huaihe Hosp, Radiol Dept, Kaifeng, Peoples R China
[2] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
[3] Henan Univ, Sch Software, Kaifeng 475004, Peoples R China
关键词
bioinformatics; deep learning; nonclassical secreted protein; prediction; WEB SERVER; PLASMA; CLASSIFICATION;
D O I
10.1002/cem.3553
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the current nonclassical proteins prediction methods involve manual feature selection, such as constructing features of samples based on the physicochemical properties of proteins and position-specific scoring matrix (PSSM). However, these tasks require researchers to perform some tedious search work to obtain the physicochemical properties of proteins. This paper proposes an end-to-end nonclassical secreted protein prediction model based on deep learning, named DeepNCSPP, which employs the protein sequence information and sequence statistics information as input to predict whether it is a nonclassical secreted protein. The protein sequence information and sequence statistics information are extracted using bidirectional long- and short-term memory and convolutional neural networks, respectively. Among the experiments conducted on the independent test dataset, DeepNCSPP achieved excellent results with an accuracy of 88.24%, Matthews coefficient (MCC) of 77.01%, and F1-score of 87.50%. Independent test dataset testing and 10-fold cross-validation show that DeepNCSPP achieves competitive performance with state-of-the-art methods and can be used as a reliable nonclassical secreted protein prediction model. A web server has been constructed for the convenience of researchers. The web link is . The source code of DeepNCSPP has been hosted on GitHub and is available online (). This paper proposes an end-to-end nonclassical secreted protein prediction model DeepNCSPP based on deep learning, which employs the protein sequence information and sequence statistics information as input to predict whether it is a nonclassical secreted protein. The protein sequence information and sequence statistics information are extracted using bidirectional long- and short-term memory and convolutional neural networks, respectively.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] An efficient hybrid weather prediction model based on deep learning
    Utku, A.
    Can, U.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2023, 20 (10) : 11107 - 11120
  • [32] Deep learning based software defect prediction
    Qiao, Lei
    Li, Xuesong
    Umer, Qasim
    Guo, Ping
    NEUROCOMPUTING, 2020, 385 : 100 - 110
  • [33] An efficient hybrid weather prediction model based on deep learning
    A. Utku
    U. Can
    International Journal of Environmental Science and Technology, 2023, 20 : 11107 - 11120
  • [34] Deep Learning-Based Model for Financial Distress Prediction
    Elhoseny, Mohamed
    Metawa, Noura
    Sztano, Gabor
    El-hasnony, Ibrahim M.
    ANNALS OF OPERATIONS RESEARCH, 2025, 345 (2-3) : 885 - 907
  • [35] Blood cancer prediction model based on deep learning technique
    Shehta, Amr I.
    Nasr, Mona
    El Ghazali, Alaa El Din M.
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [36] Deep Learning Model for Prediction of Diffusion in Defect Substances
    AlArfaj, Abeer Abdulaziz
    Mahmoud, Hanan Ahmed Hosni
    PROCESSES, 2022, 10 (08)
  • [37] Down Syndrome Prediction/Screening Model Based on Deep Learning and Illumina Genotyping Array
    Feng, Bing
    Hoskins, William
    Zhang, Yan
    Meng, Zibo
    Samuels, David C.
    Guo, Yan
    Tang, Jijun
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 347 - 352
  • [38] A novel deep learning model based on convolutional neural networks for employee churn prediction
    Ozmen, Ebru Pekel
    Ozcan, Tuncay
    JOURNAL OF FORECASTING, 2022, 41 (03) : 539 - 550
  • [39] NCSP-PLM: An ensemble learning framework for predicting non- classical secreted proteins based on protein language models and deep learning
    Liu, Taigang
    Song, Chen
    Wang, Chunhua
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) : 1472 - 1488
  • [40] Deep Learning for Enhancing Diabetes Prediction
    Naz, Uzma
    Khalil, Ashraf
    Khattak, Asad
    Ali Raza, Muhammad
    Asghar, Junaid
    Asghar, Muhammad Zubair
    2024 IEEE 19TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ICIEA 2024, 2024,