共 48 条
HIV-1 tropism prediction by the XGboost and HMM methods
被引:21
作者:
Chen, Xiang
[1
]
Wang, Zhi-Xin
[1
]
Pan, Xian-Ming
[2
]
机构:
[1] Tsinghua Univ, Sch Life Sci, Key Lab, Minist Educ Prot Sci, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Sch Life Sci, Key Lab, Minist Educ Bioinformat, Beijing 100084, Peoples R China
基金:
中国国家自然科学基金;
关键词:
CORECEPTOR TROPISM;
V3;
LOOP;
CLASSIFICATION;
PHENOTYPE;
SEQUENCE;
SUBTYPE;
EPIDEMIOLOGY;
POPULATION;
MUTATIONS;
RECEPTORS;
D O I:
10.1038/s41598-019-46420-4
中图分类号:
O [数理科学和化学];
P [天文学、地球科学];
Q [生物科学];
N [自然科学总论];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Human Immunodeficiency Virus 1 (HIV-1) co-receptor usage, called tropism, is associated with disease progression towards AIDS. Furthermore, the recently developed and developing drugs against co-receptors CCR5 or CXCR4 open a new thought for HIV-1 therapy. Thus, knowledge about tropism is critical for illness diagnosis and regimen prescription. To improve tropism prediction accuracy, we developed two novel methods, the extreme gradient boosting based XGBpred and the hidden Markov model based HMMpred. Both XGBpred and HMMpred achieved higher specificities (72.56% and 72.09%) than the state-of-the-art methods Geno2pheno (61.6%) and G2p_str (68.60%) in a 10-fold cross validation test at the same sensitivity of 93.73%. Moreover, XGBpred had more outstanding performances (with AUCs 0.9483, 0.9464) than HMMpred (0.8829, 0.8774) on the Hivcopred and Newdb (created in this work) datasets containing larger proportions of hard-to-predict dual tropic samples in the X4-using tropic samples. Therefore, we recommend the use of our novel method XGBpred to predict tropism. The two methods and datasets are available via http://spg.med.tsinghua.edu.cn:23334/XGBpred/. In addition, our models identified that positions 5, 11, 13, 18, 22, 24, and 25 were correlated with HIV-1 tropism.
引用
收藏
页数:8
相关论文