MultiFeatVotPIP: a voting-based ensemble learning framework for predicting proinflammatory peptides

被引:3
作者
Yan, Chaorui [1 ]
Geng, Aoyun [1 ]
Pan, Zhuoyu [2 ]
Zhang, Zilong [1 ]
Cui, Feifei [1 ]
机构
[1] Hainan Univ, Sch Comp Sci & Technol, 58 Renmin Ave,Haidian Campus, Haikou 570228, Peoples R China
[2] Hainan Univ, Int Business Sch, 58 Renmin Ave,Haidian Campus, Haikou 570228, Peoples R China
基金
中国国家自然科学基金;
关键词
proinflammatory peptide; inflammation; feature encoding; machine learning; ensemble learning; CYTOKINES; INFLAMMATION; REPRESENTATION;
D O I
10.1093/bib/bbae505
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Inflammatory responses may lead to tissue or organ damage, and proinflammatory peptides (PIPs) are signaling peptides that can induce such responses. Many diseases have been redefined as inflammatory diseases. To identify PIPs more efficiently, we expanded the dataset and designed an ensemble learning model with manually encoded features. Specifically, we adopted a more comprehensive feature encoding method and considered the actual impact of certain features to filter them. Identification and prediction of PIPs were performed using an ensemble learning model based on five different classifiers. The results show that the model's sensitivity, specificity, accuracy, and Matthews correlation coefficient are all higher than those of the state-of-the-art models. We named this model MultiFeatVotPIP, and both the model and the data can be accessed publicly at https://github.com/ChaoruiYan019/MultiFeatVotPIP. Additionally, we have developed a user-friendly web interface for users, which can be accessed at http://www.bioai-lab.com/MultiFeatVotPIP.
引用
收藏
页数:11
相关论文
共 52 条
  • [1] XGBoost framework with feature selection for the prediction of RNA N5-methylcytosine sites
    Abbas, Zeeshan
    Rehman, Mobeen ur
    Tayara, Hilal
    Zou, Quan
    Chong, Kil To
    [J]. MOLECULAR THERAPY, 2023, 31 (08) : 2543 - 2551
  • [2] [Anonymous], Proc. SSST@EMNLP 2014, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, Doha, Qatar, 25 October 2014
  • [3] [Anonymous], Proceedings of the 2021 3rd World Symposium on Artificial Intelligence (WSAI), 1820 June 2021, Dalian, China
  • [4] Atallah R, 2019, 2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), P335
  • [5] de Oliveira CMB, 2011, REV BRAS ANESTESIOL, V61, P255, DOI 10.1016/S0034-7094(11)70029-0
  • [6] SIGNAL DETECTABILITY - THE USE OF ROC CURVES AND THEIR ANALYSES
    CENTOR, RM
    [J]. MEDICAL DECISION MAKING, 1991, 11 (02) : 102 - 106
  • [7] DeepM6ASeq-EL: prediction of human N6-methyladenosine (m6A) sites with LSTM and ensemble learning
    Chen, Juntao
    Zou, Quan
    Li, Jing
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (02)
  • [8] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [9] iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization
    Chen, Zhen
    Zhao, Pei
    Li, Chen
    Li, Fuyi
    Xiang, Dongxu
    Chen, Yong-Zi
    Akutsu, Tatsuya
    Daly, Roger J.
    Webb, Geoffrey, I
    Zhao, Quanzhi
    Kurgan, Lukasz
    Song, Jiangning
    [J]. NUCLEIC ACIDS RESEARCH, 2021, 49 (10)
  • [10] iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data
    Chen, Zhen
    Zhao, Pei
    Li, Fuyi
    Marquez-Lago, Tatiana T.
    Leier, Andre
    Revote, Jerico
    Zhu, Yan
    Powell, David R.
    Akutsu, Tatsuya
    Webb, Geoffrey, I
    Chou, Kuo-Chen
    Smith, A. Ian
    Daly, Roger J.
    Li, Jian
    Song, Jiangning
    [J]. BRIEFINGS IN BIOINFORMATICS, 2020, 21 (03) : 1047 - 1057