A prediction model of nonclassical secreted protein based on deep learning

被引:0
|
作者
Zhang, Fan [1 ,2 ]
Liu, Chaoyang [2 ]
Wang, Binjie [1 ]
He, Yiru [3 ]
Zhang, Xinhong [3 ]
机构
[1] Henan Univ, Huaihe Hosp, Radiol Dept, Kaifeng, Peoples R China
[2] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
[3] Henan Univ, Sch Software, Kaifeng 475004, Peoples R China
关键词
bioinformatics; deep learning; nonclassical secreted protein; prediction; WEB SERVER; PLASMA; CLASSIFICATION;
D O I
10.1002/cem.3553
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the current nonclassical proteins prediction methods involve manual feature selection, such as constructing features of samples based on the physicochemical properties of proteins and position-specific scoring matrix (PSSM). However, these tasks require researchers to perform some tedious search work to obtain the physicochemical properties of proteins. This paper proposes an end-to-end nonclassical secreted protein prediction model based on deep learning, named DeepNCSPP, which employs the protein sequence information and sequence statistics information as input to predict whether it is a nonclassical secreted protein. The protein sequence information and sequence statistics information are extracted using bidirectional long- and short-term memory and convolutional neural networks, respectively. Among the experiments conducted on the independent test dataset, DeepNCSPP achieved excellent results with an accuracy of 88.24%, Matthews coefficient (MCC) of 77.01%, and F1-score of 87.50%. Independent test dataset testing and 10-fold cross-validation show that DeepNCSPP achieves competitive performance with state-of-the-art methods and can be used as a reliable nonclassical secreted protein prediction model. A web server has been constructed for the convenience of researchers. The web link is . The source code of DeepNCSPP has been hosted on GitHub and is available online (). This paper proposes an end-to-end nonclassical secreted protein prediction model DeepNCSPP based on deep learning, which employs the protein sequence information and sequence statistics information as input to predict whether it is a nonclassical secreted protein. The protein sequence information and sequence statistics information are extracted using bidirectional long- and short-term memory and convolutional neural networks, respectively.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Protein subcellular and secreted localization prediction using deep learning
    Zidoum, Hamza
    Magdy, Mennatollah
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES AND ENGINEERING (ICCSE), 2018,
  • [2] ASPIRER: a new computational approach for identifying non-classical secreted proteins based on deep learning
    Wang, Xiaoyu
    Li, Fuyi
    Xu, Jing
    Rong, Jia
    Webb, Geoffrey, I
    Ge, Zongyuan
    Li, Jian
    Song, Jiangning
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [3] Prediction of Protein-DNA Binding Sites Based on Protein Language Model and Deep Learning
    Shan, Kaixuan
    Zhang, Xiankun
    Song, Chen
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 314 - 325
  • [4] MultiSec: Multi-Task Deep Learning Improves Secreted Protein Discovery in Human Body Fluids
    He, Kai
    Wang, Yan
    Xie, Xuping
    Shao, Dan
    MATHEMATICS, 2022, 10 (15)
  • [5] Deep Learning Based Prediction Model for the Next Purchase
    Utku, Anil
    Akcayol, Muhammet Ali
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2020, 20 (02) : 35 - 44
  • [6] Deep-ProBind: binding protein prediction with transformer-based deep learning model
    Khan, Salman
    Noor, Sumaiya
    Awan, Hamid Hussain
    Iqbal, Shehryar
    Alqahtani, Salman A.
    Dilshad, Naqqash
    Ahmad, Nijad
    BMC BIOINFORMATICS, 2025, 26 (01):
  • [7] A Unified Deep Learning Model for Protein Structure Prediction
    Bai, Lin
    Yang, Lina
    2017 3RD IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2017, : 248 - 253
  • [8] A Comprehensive Survey of Deep Learning Techniques in Protein Function Prediction
    Dhanuka, Richa
    Singh, Jyoti Prakash
    Tripathi, Anushree
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (03) : 2291 - 2301
  • [9] Deep learning based prediction of species-specific protein S-glutathionylation sites
    Li, Shihua
    Yu, Kai
    Wang, Dawei
    Zhang, Qingfeng
    Liu, Ze-Xian
    Zhao, Linhong
    Cheng, Han
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2020, 1868 (07):
  • [10] Protein Secondary Structure Prediction Based on Deep Learning
    Zheng, Lin
    Li, Hong-ling
    Wu, Nan
    Ao, Li
    3RD INTERNATIONAL SYMPOSIUM ON MECHATRONICS AND INDUSTRIAL INFORMATICS, (ISMII 2017), 2017, : 171 - 177