A prediction model of nonclassical secreted protein based on deep learning

被引:0
作者
Zhang, Fan [1 ,2 ]
Liu, Chaoyang [2 ]
Wang, Binjie [1 ]
He, Yiru [3 ]
Zhang, Xinhong [3 ]
机构
[1] Henan Univ, Huaihe Hosp, Radiol Dept, Kaifeng, Peoples R China
[2] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
[3] Henan Univ, Sch Software, Kaifeng 475004, Peoples R China
关键词
bioinformatics; deep learning; nonclassical secreted protein; prediction; WEB SERVER; PLASMA; CLASSIFICATION;
D O I
10.1002/cem.3553
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the current nonclassical proteins prediction methods involve manual feature selection, such as constructing features of samples based on the physicochemical properties of proteins and position-specific scoring matrix (PSSM). However, these tasks require researchers to perform some tedious search work to obtain the physicochemical properties of proteins. This paper proposes an end-to-end nonclassical secreted protein prediction model based on deep learning, named DeepNCSPP, which employs the protein sequence information and sequence statistics information as input to predict whether it is a nonclassical secreted protein. The protein sequence information and sequence statistics information are extracted using bidirectional long- and short-term memory and convolutional neural networks, respectively. Among the experiments conducted on the independent test dataset, DeepNCSPP achieved excellent results with an accuracy of 88.24%, Matthews coefficient (MCC) of 77.01%, and F1-score of 87.50%. Independent test dataset testing and 10-fold cross-validation show that DeepNCSPP achieves competitive performance with state-of-the-art methods and can be used as a reliable nonclassical secreted protein prediction model. A web server has been constructed for the convenience of researchers. The web link is . The source code of DeepNCSPP has been hosted on GitHub and is available online (). This paper proposes an end-to-end nonclassical secreted protein prediction model DeepNCSPP based on deep learning, which employs the protein sequence information and sequence statistics information as input to predict whether it is a nonclassical secreted protein. The protein sequence information and sequence statistics information are extracted using bidirectional long- and short-term memory and convolutional neural networks, respectively.
引用
收藏
页数:12
相关论文
共 47 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   The Clinical Plasma Proteome: A Survey of Clinical Assays for Proteins in Plasma and Serum [J].
Anderson, N. Leigh .
CLINICAL CHEMISTRY, 2010, 56 (02) :177-185
[3]  
Antelmann Haike, 2006, Methods Biochem Anal, V49, P179
[4]   DeepLoc: prediction of protein subcellular localization using deep learning [J].
Armenteros, Jose Juan Almagro ;
Sonderby, Casper Kaae ;
Sonderby, Soren Kaae ;
Nielsen, Henrik ;
Winther, Ole .
BIOINFORMATICS, 2017, 33 (21) :3387-3395
[5]   Non-classical protein secretion in bacteria [J].
Bendtsen, JD ;
Kiemer, L ;
Fausboll, A ;
Brunak, S .
BMC MICROBIOLOGY, 2005, 5 (1)
[6]   Feature-based prediction of non-classical and leaderless protein secretion [J].
Bendtsen, JD ;
Jensen, LJ ;
Blom, N ;
von Heijne, G ;
Brunak, S .
PROTEIN ENGINEERING DESIGN & SELECTION, 2004, 17 (04) :349-356
[7]   DCAMCP: A deep learning model based on capsule network and attention mechanism for molecular carcinogenicity prediction [J].
Chen, Zhe ;
Zhang, Li ;
Sun, Jianqiang ;
Meng, Rui ;
Yin, Shuaidong ;
Zhao, Qi .
JOURNAL OF CELLULAR AND MOLECULAR MEDICINE, 2023, 27 (20) :3117-3126
[8]   HMMER web server: interactive sequence similarity searching [J].
Finn, Robert D. ;
Clements, Jody ;
Eddy, Sean R. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :W29-W37
[9]   Predicting metabolite-disease associations based on auto-encoder and non-negative matrix factorization [J].
Gao, Hongyan ;
Sun, Jianqiang ;
Wang, Yukun ;
Lu, Yuer ;
Liu, Liyu ;
Zhao, Qi ;
Shuai, Jianwei .
BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
[10]   Framewise phoneme classification with bidirectional LSTM and other neural network architectures [J].
Graves, A ;
Schmidhuber, J .
NEURAL NETWORKS, 2005, 18 (5-6) :602-610