EnsembleNPPred: A Robust Approach to Neuropeptide Prediction and Recognition Using Ensemble Machine Learning and Deep Learning Methods

被引:0
作者
Lertampaiporn, Supatcha [1 ]
Wattanapornprom, Warin [2 ]
Thammarongtham, Chinae [1 ]
Hongsthong, Apiradee [1 ]
机构
[1] King Mongkuts Univ Technol Thonburi, Natl Sci & Technol Dev Agcy, Natl Ctr Genet Engn & Biotechnol, Biochem Engn & Syst Biol Res Grp, Bangkok 10150, Thailand
[2] King Mongkuts Univ Technol Thonburi, Fac Sci, Dept Math, Appl Comp Sci Program, Bangkok 10150, Thailand
来源
LIFE-BASEL | 2025年 / 15卷 / 07期
关键词
neuropeptide prediction; prediction model; bioinformatics; ensemble learning; machine learning; combining methods; computational peptide discovery; deep learning; voting; AMINO-ACIDS; R PACKAGE; PEPTIDES; PROTEIN; DESCRIPTORS; SCALE; PROHORMONE; SUBFAMILY; SET;
D O I
10.3390/life15071010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Neuropeptides (NPs) are a diverse group of signaling molecules involved in regulating key physiological processes such as pain perception, stress response, mood, appetite, and circadian rhythms. Acting as neurotransmitters, neuromodulators, or neurohormones, they play a critical role in modulating and fine-tuning neural signaling networks. Despite their biological significance, identifying NPs through experimental techniques remains time-consuming and resource-intensive. To support this effort, computational prediction tools have emerged as a cost-effective approach for prioritizing candidate sequences for experimental validation. In this study, we propose EnsembleNPPred, an ensemble learning framework that integrates traditional machine learning (ML) models with a deep learning (DL) component. By combining the complementary strengths of these approaches, the model aims to improve generalization and predictive robustness. EnsembleNPPred employs a majority voting mechanism to aggregate the outputs from three classifiers: Support Vector Machine (SVM), Extra Trees (ET), and a CNN-based DL model. When evaluated on independent datasets, EnsembleNPPred demonstrated consistently competitive performance, achieving improvements in both accuracy and sensitivity-specificity balance compared to several existing methods. Furthermore, testing on multiple neuropeptide families from the NeuroPep database yielded an average accuracy of 91.92%, suggesting the model's potential to generalize across diverse peptide classes. These results suggest that EnsembleNPPred may be a useful tool for early-stage neuropeptide candidate identification and for supporting downstream experimental validation.
引用
收藏
页数:19
相关论文
共 76 条
[1]   NeuroPIpred: a tool to predict, design and scan insect neuropeptides [J].
Agrawal, Piyush ;
Kumar, Sumit ;
Singh, Archana ;
Raghava, Gajendra P. S. ;
Singh, Indrakant K. .
SCIENTIFIC REPORTS, 2019, 9 (1)
[2]   Deep Dive into Fake News Detection: Feature-Centric Classification with Ensemble and Deep Learning Methods [J].
Alarfaj, Fawaz Khaled ;
Khan, Jawad Abbas .
ALGORITHMS, 2023, 16 (11)
[3]   A new subfamily of penaeidin with an additional serine-rich region from kuruma shrimp (Marsupenaeus japonicus) contributes to antimicrobial and phagocytic activities [J].
An, Ming-Yu ;
Gao, Jie ;
Zhao, Xiao-Fan ;
Wang, Jin-Xing .
DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY, 2016, 59 :186-198
[4]   Neuropeptides and Behaviors: How Small Peptides Regulate Nervous System Function and Behavioral Outputs [J].
Bhat, Umer Saleem ;
Shahi, Navneet ;
Surendran, Siju ;
Babu, Kavita .
FRONTIERS IN MOLECULAR NEUROSCIENCE, 2021, 14
[5]   Prediction of Neuropeptides from Sequence Information Using Ensemble Classifier and Hybrid Features [J].
Bin, Yannan ;
Zhang, Wei ;
Tang, Wending ;
Dai, Ruyu ;
Li, Menglu ;
Zhu, Qizhi ;
Xia, Junfeng .
JOURNAL OF PROTEOME RESEARCH, 2020, 19 (09) :3732-3740
[6]   Chemokine Heteromers and Their Impact on Cellular Function-A Conceptual Framework [J].
Blanchet, Xavier ;
Weber, Christian ;
von Hundelshausen, Philipp .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (13)
[7]   Antibacterial peptides: basic facts and emerging concepts [J].
Boman, HG .
JOURNAL OF INTERNAL MEDICINE, 2003, 254 (03) :197-215
[8]   Software Fault Prediction Using an RNN-Based Deep Learning Approach and Ensemble Machine Learning Techniques [J].
Borandag, Emin .
APPLIED SCIENCES-BASEL, 2023, 13 (03)
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]  
Burbach J., 2011, Biol, V789, P1, DOI [10.1007/978-1-61779-310-3_1, DOI 10.1007/978-1-61779-310-3_1]