HIV-1 tropism prediction by the XGboost and HMM methods

被引:21
作者
Chen, Xiang [1 ]
Wang, Zhi-Xin [1 ]
Pan, Xian-Ming [2 ]
机构
[1] Tsinghua Univ, Sch Life Sci, Key Lab, Minist Educ Prot Sci, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Sch Life Sci, Key Lab, Minist Educ Bioinformat, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
CORECEPTOR TROPISM; V3; LOOP; CLASSIFICATION; PHENOTYPE; SEQUENCE; SUBTYPE; EPIDEMIOLOGY; POPULATION; MUTATIONS; RECEPTORS;
D O I
10.1038/s41598-019-46420-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Human Immunodeficiency Virus 1 (HIV-1) co-receptor usage, called tropism, is associated with disease progression towards AIDS. Furthermore, the recently developed and developing drugs against co-receptors CCR5 or CXCR4 open a new thought for HIV-1 therapy. Thus, knowledge about tropism is critical for illness diagnosis and regimen prescription. To improve tropism prediction accuracy, we developed two novel methods, the extreme gradient boosting based XGBpred and the hidden Markov model based HMMpred. Both XGBpred and HMMpred achieved higher specificities (72.56% and 72.09%) than the state-of-the-art methods Geno2pheno (61.6%) and G2p_str (68.60%) in a 10-fold cross validation test at the same sensitivity of 93.73%. Moreover, XGBpred had more outstanding performances (with AUCs 0.9483, 0.9464) than HMMpred (0.8829, 0.8774) on the Hivcopred and Newdb (created in this work) datasets containing larger proportions of hard-to-predict dual tropic samples in the X4-using tropic samples. Therefore, we recommend the use of our novel method XGBpred to predict tropism. The two methods and datasets are available via http://spg.med.tsinghua.edu.cn:23334/XGBpred/. In addition, our models identified that positions 5, 11, 13, 18, 22, 24, and 25 were correlated with HIV-1 tropism.
引用
收藏
页数:8
相关论文
共 48 条
[1]   Phenotypic Coreceptor Tropism in Perinatally HIV-infected Youth Failing Antiretroviral Therapy [J].
Agwu, Allison L. ;
Yao, Tzy-Jyun ;
Eshleman, Susan H. ;
Patel, Kunjal ;
Huang, Wei ;
Burchett, Sandra K. ;
Siberry, George K. ;
Van Dyke, Russell B. .
PEDIATRIC INFECTIOUS DISEASE JOURNAL, 2016, 35 (07) :777-781
[2]   Chemokine receptors as HIV-1 coreceptors: Roles in viral entry, tropism, and disease [J].
Berger, EA ;
Murphy, PM ;
Farber, JM .
ANNUAL REVIEW OF IMMUNOLOGY, 1999, 17 :657-700
[3]   A new classification for HIV-1 [J].
Berger, EA ;
Doms, RW ;
Fenyö, EM ;
Korber, BTM ;
Littman, DR ;
Moore, JP ;
Sattentau, QJ ;
Schuitemaker, H ;
Sodroski, J ;
Weiss, RA .
NATURE, 1998, 391 (6664) :240-240
[4]   Classification of nuclear receptors based on amino acid composition and dipeptide composition [J].
Bhasin, M ;
Raghava, GPS .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2004, 279 (22) :23262-23266
[5]   Analysis of Physicochemical and Structural Properties Determining HIV-1 Coreceptor Usage [J].
Bozek, Katarzyna ;
Lengauer, Thomas ;
Sierra, Saleta ;
Kaiser, Rolf ;
Domingues, Francisco S. .
PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (03)
[6]   Molecular and clinical epidemiology of CXCR4-using HIV-1 in a large population of antiretroviral-naive individuals [J].
Brumme, ZL ;
Goodrich, J ;
Mayer, HB ;
Brumme, CJ ;
Henrick, BM ;
Wynhoven, B ;
Asselin, JJ ;
Cheung, PK ;
Hogg, RS ;
Montaner, JSG ;
Harrigan, PR .
JOURNAL OF INFECTIOUS DISEASES, 2005, 192 (03) :466-474
[7]   Full genome sequences of human immunodeficiency virus type 1 subtypes G and A/G intersubtype recombinants [J].
Carr, JK ;
Salminen, MO ;
Albert, J ;
Sanders-Buell, E ;
Gotte, D ;
Birx, DL ;
McCutchan, FE .
VIROLOGY, 1998, 247 (01) :22-31
[8]   MINIMAL REQUIREMENTS FOR THE HUMAN-IMMUNODEFICIENCY-VIRUS TYPE-1 V3 DOMAIN TO SUPPORT THE SYNCYTIUM-INDUCING PHENOTYPE - ANALYSIS BY SINGLE AMINO-ACID SUBSTITUTION [J].
DEJONG, JJ ;
DERONDE, A ;
KEULEN, W ;
TERSMETTE, M ;
GOUDSMIT, J .
JOURNAL OF VIROLOGY, 1992, 66 (11) :6777-6780
[9]   Selected amino acid mutations in HIV-1 B subtype gp41 are Associated with Specific gp120V3 signatures in the regulation of Co-Receptor usage [J].
Dimonte, Salvatore ;
Mercurio, Fabio ;
Svicher, Valentina ;
D'Arrigo, Roberta ;
Perno, Carlo-Federico ;
Ceccherini-Silberstein, Francesca .
RETROVIROLOGY, 2011, 8
[10]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763