共 30 条
Prediction of anti-inflammatory peptides by a sequence-based stacking ensemble model named AIPStack
被引:18
|作者:
Deng, Hua
[1
]
Lou, Chaofeng
[1
]
Wu, Zengrui
[1
]
Li, Weihua
[1
]
Liu, Guixia
[1
]
Tang, Yun
[1
]
机构:
[1] East China Univ Sci & Technol, Sch Pharm, Shanghai Frontiers Sci Ctr Optogenet Tech Cell Met, Shanghai 200237, Peoples R China
来源:
基金:
中国国家自然科学基金;
关键词:
AMINO-ACID-COMPOSITION;
CLASSIFICATION;
MECHANISMS;
PROTEIN;
D O I:
10.1016/j.isci.2022.104967
中图分类号:
O [数理科学和化学];
P [天文学、地球科学];
Q [生物科学];
N [自然科学总论];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Accurate and efficient identification of anti-inflammatory peptides (AIPs) is crucial for the treatment of inflammation. Here, we proposed a two-layer stacking ensemble model, AIPStack, to effectively predict AIPs. At first, we constructed a new dataset for model building and validation. Then, peptide sequences were represented by hybrid features, which were fused by two amino acid composition descriptors. Next, the stacking ensemble model was constructed by random forest and extremely randomized tree as the base-classifiers and logistic regression as themeta-classifier to receive the outputs from the base-classifiers. AIPStack achieved an AUC of 0.819, accuracy of 0.755, and MCC of 0.510 on the independent set 3, which were higher than other AIP predictors. Furthermore, the essential sequence features were highlighted by the Shapley Additive exPlanation (SHAP) method. It is anticipated that AIPStack could be used for AIP prediction in a high-throughput manner and facilitate the hypothesis-driven experimental design.
引用
收藏
页数:20
相关论文