ADP-Fuse: A novel two-layer machine learning predictor to identify antidiabetic peptides and diabetes types using multiview information

被引:16
作者
Basith, Shaherin [1 ]
Pham, Nhat Truong [2 ]
Song, Minkyung [2 ,3 ]
Lee, Gwang [1 ,4 ]
Manavalan, Balachandran [2 ]
机构
[1] Ajou Univ, Sch Med, Dept Physiol, Suwon 16499, South Korea
[2] Sungkyunkwan Univ, Coll Biotechnol & Bioengn, Dept Integrat Biotechnol, Suwon 16419, South Korea
[3] Sungkyunkwan Univ, Dept Biopharmaceut Convergence, Suwon 16419, South Korea
[4] Ajou Univ, Dept Mol Sci & Technol, Suwon 16499, South Korea
基金
新加坡国家研究基金会;
关键词
Antidiabetic peptides; Sequence analysis; Bioinformatics; Multiview information; Machine learning; Stacking ensemble learning; WEB SERVER;
D O I
10.1016/j.compbiomed.2023.107386
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Diabetes mellitus has become a major public health concern associated with high mortality and reduced life expectancy and can cause blindness, heart attacks, kidney failure, lower limb amputations, and strokes. A new generation of antidiabetic peptides (ADPs) that act on beta-cells or T-cells to regulate insulin production is being developed to alleviate the effects of diabetes. However, the lack of effective peptide-mining tools has hampered the discovery of these promising drugs. Hence, novel computational tools need to be developed urgently. In this study, we present ADP-Fuse, a novel two-layer prediction framework capable of accurately identifying ADPs or non-ADPs and categorizing them into type 1 and type 2 ADPs. First, we comprehensively evaluated 22 peptide sequence-derived features coupled with eight notable machine learning algorithms. Subsequently, the most suitable feature descriptors and classifiers for both layers were identified. The output of these single-feature models, embedded with multiview information, was trained with an appropriate classifier to provide the final prediction. Comprehensive cross-validation and independent tests substantiate that ADP-Fuse surpasses singlefeature models and the feature fusion approach for the prediction of ADPs and their types. In addition, the SHapley Additive exPlanation method was used to elucidate the contributions of individual features to the prediction of ADPs and their types. Finally, a user-friendly web server for ADP-Fuse was developed and made publicly accessible (https://balalab-skku.org/ADP-Fuse), enabling the swift screening and identification of novel ADPs and their types. This framework is expected to contribute significantly to antidiabetic peptide identification.
引用
收藏
页数:8
相关论文
共 54 条
[1]   AntiCP 2.0: an updated model for predicting anticancer peptides [J].
Agrawal, Piyush ;
Bhagat, Dhruv ;
Mahalwal, Manish ;
Sharma, Neelam ;
Raghava, Gajendra P. S. .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
[2]   m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation [J].
Ao, Chunyan ;
Ye, Xiucai ;
Sakurai, Tetsuya ;
Zou, Quan ;
Yu, Liang .
BMC BIOLOGY, 2023, 21 (01)
[3]   STALLION: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction [J].
Basith, Shaherin ;
Lee, Gwang ;
Manavalan, Balachandran .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
[4]   Machine intelligence in peptide therapeutics: A next-generation tool for rapid disease screening [J].
Basith, Shaherin ;
Manavalan, Balachandran ;
Shin, Tae Hwan ;
Lee, Gwang .
MEDICINAL RESEARCH REVIEWS, 2020, 40 (04) :1276-1314
[5]   SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome [J].
Basith, Shaherin ;
Manavalan, Balachandran ;
Shin, Tae Hwan ;
Lee, Gwang .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2019, 18 :131-141
[6]   ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species [J].
Chen, Ruyi ;
Li, Fuyi ;
Guo, Xudong ;
Bi, Yue ;
Li, Chen ;
Pan, Shirui ;
Coin, Lachlan J. M. ;
Song, Jiangning .
BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
[7]   AntiDMPpred: a web service for identifying anti-diabetic peptides [J].
Chen, Xue ;
Huang, Jian ;
He, Bifang .
PEERJ, 2022, 10
[8]   iFeatureOmega: an integrative platform for engineering, visualization and analysis of features from molecular sequences, structural and ligand data sets [J].
Chen, Zhen ;
Liu, Xuhan ;
Zhao, Pei ;
Li, Chen ;
Wang, Yanan ;
Li, Fuyi ;
Akutsu, Tatsuya ;
Bain, Chris ;
Gasser, Robin B. ;
Li, Junzhou ;
Yang, Zuoren ;
Gao, Xin ;
Kurgan, Lukasz ;
Song, Jiangning .
NUCLEIC ACIDS RESEARCH, 2022, 50 (W1) :W434-W447
[9]   iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data [J].
Chen, Zhen ;
Zhao, Pei ;
Li, Fuyi ;
Marquez-Lago, Tatiana T. ;
Leier, Andre ;
Revote, Jerico ;
Zhu, Yan ;
Powell, David R. ;
Akutsu, Tatsuya ;
Webb, Geoffrey, I ;
Chou, Kuo-Chen ;
Smith, A. Ian ;
Daly, Roger J. ;
Li, Jian ;
Song, Jiangning .
BRIEFINGS IN BIOINFORMATICS, 2020, 21 (03) :1047-1057
[10]   iFeature: a Python']Python package and web server for features extraction and selection from protein and peptide sequences [J].
Chen, Zhen ;
Zhao, Pei ;
Li, Fuyi ;
Leier, Andre ;
Marquez-Lago, Tatiana T. ;
Wang, Yanan ;
Webb, Geoffrey I. ;
Smith, A. Ian ;
Daly, Roger J. ;
Chou, Kuo-Chen ;
Song, Jiangning .
BIOINFORMATICS, 2018, 34 (14) :2499-2502