ADP-Fuse: A novel two-layer machine learning predictor to identify antidiabetic peptides and diabetes types using multiview information

被引:8
作者
Basith, Shaherin [1 ]
Pham, Nhat Truong [2 ]
Song, Minkyung [2 ,3 ]
Lee, Gwang [1 ,4 ]
Manavalan, Balachandran [2 ]
机构
[1] Ajou Univ, Sch Med, Dept Physiol, Suwon 16499, South Korea
[2] Sungkyunkwan Univ, Coll Biotechnol & Bioengn, Dept Integrat Biotechnol, Suwon 16419, South Korea
[3] Sungkyunkwan Univ, Dept Biopharmaceut Convergence, Suwon 16419, South Korea
[4] Ajou Univ, Dept Mol Sci & Technol, Suwon 16499, South Korea
基金
新加坡国家研究基金会;
关键词
Antidiabetic peptides; Sequence analysis; Bioinformatics; Multiview information; Machine learning; Stacking ensemble learning; WEB SERVER;
D O I
10.1016/j.compbiomed.2023.107386
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Diabetes mellitus has become a major public health concern associated with high mortality and reduced life expectancy and can cause blindness, heart attacks, kidney failure, lower limb amputations, and strokes. A new generation of antidiabetic peptides (ADPs) that act on beta-cells or T-cells to regulate insulin production is being developed to alleviate the effects of diabetes. However, the lack of effective peptide-mining tools has hampered the discovery of these promising drugs. Hence, novel computational tools need to be developed urgently. In this study, we present ADP-Fuse, a novel two-layer prediction framework capable of accurately identifying ADPs or non-ADPs and categorizing them into type 1 and type 2 ADPs. First, we comprehensively evaluated 22 peptide sequence-derived features coupled with eight notable machine learning algorithms. Subsequently, the most suitable feature descriptors and classifiers for both layers were identified. The output of these single-feature models, embedded with multiview information, was trained with an appropriate classifier to provide the final prediction. Comprehensive cross-validation and independent tests substantiate that ADP-Fuse surpasses singlefeature models and the feature fusion approach for the prediction of ADPs and their types. In addition, the SHapley Additive exPlanation method was used to elucidate the contributions of individual features to the prediction of ADPs and their types. Finally, a user-friendly web server for ADP-Fuse was developed and made publicly accessible (https://balalab-skku.org/ADP-Fuse), enabling the swift screening and identification of novel ADPs and their types. This framework is expected to contribute significantly to antidiabetic peptide identification.
引用
收藏
页数:8
相关论文
共 54 条
  • [1] AntiCP 2.0: an updated model for predicting anticancer peptides
    Agrawal, Piyush
    Bhagat, Dhruv
    Mahalwal, Manish
    Sharma, Neelam
    Raghava, Gajendra P. S.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [2] m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation
    Ao, Chunyan
    Ye, Xiucai
    Sakurai, Tetsuya
    Zou, Quan
    Yu, Liang
    [J]. BMC BIOLOGY, 2023, 21 (01)
  • [3] STALLION: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction
    Basith, Shaherin
    Lee, Gwang
    Manavalan, Balachandran
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [4] Machine intelligence in peptide therapeutics: A next-generation tool for rapid disease screening
    Basith, Shaherin
    Manavalan, Balachandran
    Shin, Tae Hwan
    Lee, Gwang
    [J]. MEDICINAL RESEARCH REVIEWS, 2020, 40 (04) : 1276 - 1314
  • [5] SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome
    Basith, Shaherin
    Manavalan, Balachandran
    Shin, Tae Hwan
    Lee, Gwang
    [J]. MOLECULAR THERAPY-NUCLEIC ACIDS, 2019, 18 : 131 - 141
  • [6] ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species
    Chen, Ruyi
    Li, Fuyi
    Guo, Xudong
    Bi, Yue
    Li, Chen
    Pan, Shirui
    Coin, Lachlan J. M.
    Song, Jiangning
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
  • [7] AntiDMPpred: a web service for identifying anti-diabetic peptides
    Chen, Xue
    Huang, Jian
    He, Bifang
    [J]. PEERJ, 2022, 10
  • [8] iFeatureOmega: an integrative platform for engineering, visualization and analysis of features from molecular sequences, structural and ligand data sets
    Chen, Zhen
    Liu, Xuhan
    Zhao, Pei
    Li, Chen
    Wang, Yanan
    Li, Fuyi
    Akutsu, Tatsuya
    Bain, Chris
    Gasser, Robin B.
    Li, Junzhou
    Yang, Zuoren
    Gao, Xin
    Kurgan, Lukasz
    Song, Jiangning
    [J]. NUCLEIC ACIDS RESEARCH, 2022, 50 (W1) : W434 - W447
  • [9] iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data
    Chen, Zhen
    Zhao, Pei
    Li, Fuyi
    Marquez-Lago, Tatiana T.
    Leier, Andre
    Revote, Jerico
    Zhu, Yan
    Powell, David R.
    Akutsu, Tatsuya
    Webb, Geoffrey, I
    Chou, Kuo-Chen
    Smith, A. Ian
    Daly, Roger J.
    Li, Jian
    Song, Jiangning
    [J]. BRIEFINGS IN BIOINFORMATICS, 2020, 21 (03) : 1047 - 1057
  • [10] iFeature: a Python']Python package and web server for features extraction and selection from protein and peptide sequences
    Chen, Zhen
    Zhao, Pei
    Li, Fuyi
    Leier, Andre
    Marquez-Lago, Tatiana T.
    Wang, Yanan
    Webb, Geoffrey I.
    Smith, A. Ian
    Daly, Roger J.
    Chou, Kuo-Chen
    Song, Jiangning
    [J]. BIOINFORMATICS, 2018, 34 (14) : 2499 - 2502