The HIV Mutation Browser: A Resource for Human Immunodeficiency Virus Mutagenesis and Polymorphism Data

被引:20
作者
Davey, Norman E. [1 ,2 ,3 ]
Satagopam, Venkata P. [4 ]
Santiago-Mozos, Salvador [1 ]
Villacorta-Martin, Carlos [1 ]
Bharat, Tanmay A. M. [1 ]
Schneider, Reinhard [4 ]
Briggs, John A. G. [1 ,5 ]
机构
[1] EMBL, Struct & Computat Biol Unit, Heidelberg, Germany
[2] Univ Calif San Francisco, Dept Physiol, San Francisco, CA USA
[3] Univ Calif San Francisco, Dept Biochem & Biophys, San Francisco, CA 94143 USA
[4] Luxembourg Ctr Syst Biomed, Belval, Luxembourg
[5] Univ Klinikum Heidelberg, EMBL, Mol Med Partnership Unit, Heidelberg, Germany
关键词
EXTRACTION; PERFORMANCE;
D O I
10.1371/journal.pcbi.1003951
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Huge research effort has been invested over many years to determine the phenotypes of natural or artificial mutations in HIV proteins-interpretation of mutation phenotypes is an invaluable source of new knowledge. The results of this research effort are recorded in the scientific literature, but it is difficult for virologists to rapidly find it. Manually locating data on phenotypic variation within the approximately 270,000 available HIV-related research articles, or the further 1,500 articles that are published each month is a daunting task. Accordingly, the HIV research community would benefit from a resource cataloguing the available HIV mutation literature. We have applied computational text-mining techniques to parse and map mutagenesis and polymorphism information from the HIV literature, have enriched the data with ancillary information and have developed a public, web-based interface through which it can be intuitively explored: the HIV mutation browser. The current release of the HIV mutation browser describes the phenotypes of 7,608 unique mutations at 2,520 sites in the HIV proteome, resulting from the analysis of 120,899 papers. The mutation information for each protein is organised in a residue-centric manner and each residue is linked to the relevant experimental literature. The importance of HIV as a global health burden advocates extensive effort to maximise the efficiency of HIV research. The HIV mutation browser provides a valuable new resource for the research community. The HIV mutation browser is available at: http://hivmut.org.
引用
收藏
页数:8
相关论文
共 17 条
[1]   Activities at the Universal Protein Resource (UniProt) [J].
Apweiler, Rolf ;
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightingale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Corbett, Matt .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D191-D198
[2]   MutationFinder: a high-performance system for extracting point mutation mentions from text [J].
Caporaso, J. Gregory ;
Baumgartner, William A., Jr. ;
Randolph, David A. ;
Cohen, K. Bretonnel ;
Hunter, Lawrence .
BIOINFORMATICS, 2007, 23 (14) :1862-1865
[3]   The eukaryotic linear motif resource ELM: 10 years and counting [J].
Dinkel, Holger ;
Van Roey, Kim ;
Michael, Sushama ;
Davey, Norman E. ;
Weatheritt, Robert J. ;
Born, Diana ;
Speck, Tobias ;
Krueger, Daniel ;
Grebnev, Gleb ;
Kuban, Marta ;
Strumillo, Marta ;
Uyar, Bora ;
Budd, Aidan ;
Altenberg, Brigitte ;
Seiler, Markus ;
Chemes, Lucia B. ;
Glavina, Juliana ;
Sanchez, Ignacio E. ;
Diella, Francesca ;
Gibson, Toby J. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D259-D266
[4]   The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins [J].
Dosztányi, Z ;
Csizmók, V ;
Tompa, P ;
Simon, I .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 347 (04) :827-839
[5]   Toward an automatic method for extracting cancer- and other disease-related point mutations from the biomedical literature [J].
Doughty, Emily ;
Kertesz-Farkas, Attila ;
Bodenreider, Olivier ;
Thompson, Gary ;
Adadey, Asa ;
Peterson, Thomas ;
Kann, Maricel G. .
BIOINFORMATICS, 2011, 27 (03) :408-415
[6]   Regulated degradation of the HIV-1 Vpu protein through a βTrCP-Independent pathway limits the release of viral particles [J].
Estrabaud, Emilie ;
Le Rouzic, Erwann ;
Lopez-Verges, Sandra ;
Morel, Marina ;
Belaidouni, Nadia ;
Benarous, Richard ;
Transy, Catherine ;
Berlioz-Torrent, Clarisse ;
Margottin-Goguet, Florence .
PLOS PATHOGENS, 2007, 3 (07) :995-1004
[7]   The first postmodern pandemic: 25 years of HIV/AIDS [J].
Kallings, Lars O. .
JOURNAL OF INTERNAL MEDICINE, 2008, 263 (03) :218-243
[8]   MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability [J].
Katoh, Kazutaka ;
Standley, Daron M. .
MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (04) :772-780
[9]   Extraction of human kinase mutations from literature, databases and genotyping studies [J].
Krallinger, Martin ;
Izarzugaza, Jose M. G. ;
Rodriguez-Penagos, Carlos ;
Valencia, Alfonso .
BMC BIOINFORMATICS, 2009, 10
[10]   Algorithms and semantic infrastructure for mutation impact extraction and grounding [J].
Laurila, Jonas B. ;
Naderi, Nona ;
Witte, Rene ;
Riazanov, Alexandre ;
Kouznetsov, Alexandre ;
Baker, Christopher J. O. .
BMC GENOMICS, 2010, 11