iDPPIV-SI: identifying dipeptidyl peptidase IV inhibitory peptides by using multiple sequence information

被引:0
作者
Zou, Hongliang [1 ]
机构
[1] Jiangxi Sci & Technol Normal Univ, Sch Commun & Elect, Nanchang, Peoples R China
关键词
Dipeptidyl peptidase IV inhibitory peptides; physicochemical properties; correlation methods; discrete wavelet transform; support vector machine; ACCURATE PREDICTION; IDENTIFICATION; NETWORKS; PROTEINS; TRIPEPTIDES; CLASSIFIER; ALGORITHM; INTERNET; MODEL; SVM;
D O I
10.1080/07391102.2023.2203257
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Currently, diabetes has become a great threaten for people's health in the world. Recent study shows that dipeptidyl peptidase IV (DPP-IV) inhibitory peptides may be a potential pharmaceutical agent to treat diabetes. Thus, there is a need to discriminate DPP-IV inhibitory peptides from non-DPP-IV inhibi-tory peptides. To address this issue, a novel computational model called iDPPIV-SI was developed in this study. In the first, 50 different types of physicochemical (PC) properties were employed to denote the peptide sequences. Three different feature descriptors including the 1-order, 2-order correlation methods and discrete wavelet transform were applied to collect useful information from the PC matrix. Furthermore, the least absolute shrinkage and selection operator (LASSO) algorithm was employed to select these most discriminative features. All of these chosen features were fed into support vector machine (SVM) for identifying DPP-IV inhibitory peptides. The iDPPIV-SI achieved 91.26% and 98.12% classification accuracies on the training and independent dataset, respectively. There is a significantly improvement in the classification performance by the proposed method, as compared with the state-of-the-art predictors. The datasets and MATLAB codes (based on MATLAB2015b) used in current study are available at https://figshare.com/articles/online_resource/iDPPIV-SI/20085878.
引用
收藏
页码:2144 / 2152
页数:9
相关论文
共 71 条
  • [11] iTTCA-Hybrid: Improved and robust identification of tumor T cell antigens by utilizing hybrid feature representation
    Charoenkwan, Phasit
    Nantasenamat, Chanin
    Hasan, Md Mehedi
    Shoombuatong, Watshara
    [J]. ANALYTICAL BIOCHEMISTRY, 2020, 599
  • [12] PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method
    Charoenkwan, Phasit
    Kanthawong, Sakawrat
    Schaduangrat, Nalini
    Yana, Janchai
    Shoombuatong, Watshara
    [J]. CELLS, 2020, 9 (02)
  • [13] Chatterjee I., 2021, International Journal of Modern Research, V1, P15
  • [14] Multivariate classification of autism spectrum disorder using frequency-specific resting-state functional connectivity-A multi-center study
    Chen, Heng
    Duan, Xujun
    Liu, Feng
    Lu, Fengmei
    Ma, Xujing
    Zhang, Youxue
    Uddin, Lucina Q.
    Chen, Huafu
    [J]. PROGRESS IN NEURO-PSYCHOPHARMACOLOGY & BIOLOGICAL PSYCHIATRY, 2016, 64 : 1 - 9
  • [15] iATP: A Sequence Based Method for identifying Anti-tubercular Peptides
    Chen, Wei
    Feng, Pengmian
    Nie, Fulei
    [J]. MEDICINAL CHEMISTRY, 2020, 16 (05) : 620 - 625
  • [16] SUPPORT-VECTOR NETWORKS
    CORTES, C
    VAPNIK, V
    [J]. MACHINE LEARNING, 1995, 20 (03) : 273 - 297
  • [17] UbiSitePred: A novel method for improving the accuracy of ubiquitination sites prediction by using LASSO to select the optimal Chou's pseudo components
    Cui, Xiaowen
    Yu, Zhaomin
    Yu, Bin
    Wang, Minghui
    Tian, Baoguang
    Ma, Qin
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 184 : 28 - 43
  • [18] BBPpred: Sequence-Based Prediction of Blood-Brain Barrier Peptides with Feature Representation Learning and Logistic Regression
    Dai, Ruyu
    Zhang, Wei
    Tang, Wending
    Wynendaele, Evelien
    Zhu, Qizhi
    Bin, Yannan
    De Spiegeleer, Bart
    Xia, Junfeng
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (01) : 525 - 534
  • [19] Computer-aided prediction and design of IL-6 inducing peptides: IL-6 plays a crucial role in COVID-19
    Dhall, Anjali
    Patiyal, Sumeet
    Sharma, Neelam
    Usmani, Salman Sadullah
    Raghava, Gajendra P. S.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (02) : 936 - 945
  • [20] Human protein subcellular localization identification via fuzzy model on Kernelized Neighborhood Representation
    Ding, Yijie
    Tang, Jijun
    Guo, Fei
    [J]. APPLIED SOFT COMPUTING, 2020, 96