IF-AIP: A machine learning method for the identification of anti-inflammatory peptides using multi-feature fusion strategy

被引:24
作者
Gaffar, Saima [1 ]
Hassan, Mir Tanveerul [1 ]
Tayara, Hilal [2 ]
Chong, Kil To [1 ,3 ]
机构
[1] Jeonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea
[2] Jeonbuk Natl Univ, Sch Int Engn & Sci, Jeonju 54896, South Korea
[3] Jeonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea
基金
新加坡国家研究基金会;
关键词
Anti-inflammatory peptides; Machine learning; Voting classifier; Bioinformatics; AMINO-ACID-COMPOSITION; INFLAMMATION; MECHANISMS;
D O I
10.1016/j.compbiomed.2023.107724
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The most commonly used therapy currently for inflammatory and autoimmune diseases is non-specific anti-inflammatory drugs, which have various hazardous side effects. Recently, some anti-inflammatory peptides (AIPs) have been found to be a substitute therapy for inflammatory diseases like rheumatoid arthritis and Alzheimer's. Therefore, the identification of these AIPs is an emerging topic that is equally important.Methods: In this work, we have proposed an identification model for AIPs using a voting classifier. We used eight different feature descriptors and five conventional machine-learning classifiers. The eight feature encodings were concatenated to get a hybrid feature set. The five baseline models trained on the hybrid feature set were integrated via a voting classifier. Finally, a feature selection algorithm was used to select the optimal feature set for the construction of our final model, named IF-AIP.Results: We tested the proposed model on two independent datasets. On independent data 1, the IF-AIP model shows an improvement of 3%-5.6% in terms of accuracies and 6.7%-10.8% in terms of MCC compared to the existing methods. On the independent dataset 2, our model IF-AIP shows an overall improvement of 2.9%-5.7% in terms of accuracy and 8.3%-8.6% in terms of MCC score compared to the existing methods. A comparative performance analysis was conducted between the proposed model and existing methods using a set of 24 novel peptide sequences. Notably, the IF-AIP method exhibited exceptional accuracy, correctly identifying all 24 peptides as AIPs. The source code, pre-trained models, and all datasets are made available at https://github.com/Mir-Saima/IF-AIP.
引用
收藏
页数:8
相关论文
共 39 条
[11]   Prediction of protein subcellular locations by incorporating quasi-sequence-order effect [J].
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2000, 278 (02) :477-483
[12]   Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes [J].
Chou, KC .
BIOINFORMATICS, 2005, 21 (01) :10-19
[13]   Pseudo Amino Acid Composition and its Applications in Bioinformatics, Proteomics and System Biology [J].
Chou, Kuo-Chen .
CURRENT PROTEOMICS, 2009, 6 (04) :262-274
[14]   A new approach for determining SARS-CoV-2 epitopes using machine learning-based in silico methods [J].
Cihan, Pinar ;
Ozger, Zeynep Banu .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2022, 98
[15]   Novel Imaging Approaches in Inflammatory Bowel Diseases [J].
Deepak, Parakkal ;
Fowler, Kathryn J. ;
Fletcher, Joel G. ;
Bruining, David H. .
INFLAMMATORY BOWEL DISEASES, 2019, 25 (02) :248-260
[16]   An ensemble of stacking classifiers for improved prediction of miRNA-mRNA interactions [J].
Dhakal, Priyash ;
Tayara, Hilal ;
Chong, Kil To .
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
[17]   Chronic inflammation:: importance of NOD2 and NALP3 in interleukin-1β generation [J].
Ferrero-Miliani, L. ;
Nielsen, O. H. ;
Andersen, P. S. ;
Girardin, S. E. .
CLINICAL AND EXPERIMENTAL IMMUNOLOGY, 2007, 147 (02) :227-235
[18]   CD-HIT: accelerated for clustering the next-generation sequencing data [J].
Fu, Limin ;
Niu, Beifang ;
Zhu, Zhengwei ;
Wu, Sitao ;
Li, Weizhong .
BIOINFORMATICS, 2012, 28 (23) :3150-3152
[19]   Effects of Nonsteroidal Anti-Inflammatory Drugs at the Molecular Level [J].
Gunaydin, Caner ;
Bilge, S. Sirri .
EURASIAN JOURNAL OF MEDICINE, 2018, 50 (02) :116-121
[20]   Membrane protein type prediction for high-dimensional imbalanced datasets [J].
Guo, Lei ;
Wang, Shunfang .
2018 NINTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME 2018), 2018, :847-851