Improving sequence variant descriptions in mutation Databases and literature using the mutalyzer sequence variation nomenclature checker

被引:327
作者
Wildeman, Martin [1 ]
van Ophuizen, Ernest [1 ]
den Dunnen, Johan T. [1 ]
Taschner, Peter E. M. [1 ]
机构
[1] Leiden Univ, Med Ctr, Ctr Human & Clin Genet, Dept Human Genet, NL-2300 RC Leiden, Netherlands
关键词
sequence variant description check; mutation nomenclature; locus-specific mutation database; software;
D O I
10.1002/humu.20654
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Unambiguous and correct sequence variant descriptions are of utmost importance, not in the least since mistakes and uncertainties may lead to undesired errors in clinical diagnosis. We developed the Mutation Analyzer (Mutalyzer) sequence variation nomenclature checker (www.lovd.nl/mutalyzer; last accessed 13 September 2007) for automated analysis and correction of sequence variant descriptions using reference sequences from any organism. Mutalyzer handles most variation types: substitution, deletion, duplication, insertion, indel, and splice,site changes following current recommendations of the Human Genome Variation Society (HGVS). Input is a GenBank accession number or an uploaded reference sequence file in GenBank format with user-modified annotation, an HGNC gene symbol, and the variant (single or in a batch file). Mutalyzer generates variant descriptions at DNA level, the level of all annotated transcripts and the deduced outcome at protein level. To validate Mutalyzer's performance and to investigate the sequence variant description quality in locus-specific mutation databases (LSDBs), more than 11,000 variants in the PAH, BIC BRCA2, and HbVar databases were analyzed, showing that 87%, 25%, and 38%, respectively, were error-free and following the recommendations. Low recognition rates in BIC and HbVar (38% and 51%, respectively) were due to lack of a well-annotated genomic reference sequence (HbVar) or noncompliance to the guidelines (BRCA2). Provided with well-annotated genomic reference sequences, Mutalyzer is very effective for the curation of newly discovered sequence variation descriptions and existing LSDB, data. Mutalyzer will be linked to the Leiden Open source Variation Database (LOVD) (www.LOVD.nl; last accessed 13 September 2007) and is the first module of a sequence variant effect prediction package.
引用
收藏
页码:6 / 13
页数:8
相关论文
共 14 条
[1]  
Antonarakis SE, 1998, HUM MUTAT, V11, P1
[2]   Time for a unified system of mutation description and reporting: A review of locus-specific mutation Databases [J].
Claustres, M ;
Horaitis, O ;
Vanevski, M ;
Cotton, RGH .
GENOME RESEARCH, 2002, 12 (05) :680-688
[3]   Recommendations of the 2006 Human Variome Project meeting [J].
Cotton, Richard G. H. .
NATURE GENETICS, 2007, 39 (04) :433-436
[4]   Standardizing mutation nomenclature: Why bother.? [J].
den Dunnen, JT ;
Paalman, MH .
HUMAN MUTATION, 2003, 22 (03) :181-182
[5]  
den Dunnen JT, 2000, HUM MUTAT, V15, P7
[6]   LOVD: Easy creation of a locus-specific sequence variation database using an "LSDB-in-a-Box" approach [J].
Fokkema, IFAC ;
den Dunnen, JT ;
Taschner, PEM .
HUMAN MUTATION, 2005, 26 (02) :63-68
[7]   HGVbase:: a curated resource describing human DNA variation and phenotype relationships [J].
Fredman, D ;
Munns, G ;
Rios, D ;
Sjöholm, F ;
Siegfried, M ;
Lenhard, B ;
Lehväslaiho, H ;
Brookes, AJ .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D516-D519
[8]  
Hamosh A, 2005, NUCLEIC ACIDS RES, V33, pD514
[9]   HbVar.: A relational database of human hemoglobin variants and thalassemia mutations at the globin gene server [J].
Hardison, RC ;
Chui, DHK ;
Giardine, B ;
Riemer, C ;
Patrinos, GP ;
Anagnou, N ;
Miller, W ;
Wajcman, H .
HUMAN MUTATION, 2002, 19 (03) :225-233
[10]  
Scriver CR, 2000, HUM MUTAT, V15, P99, DOI 10.1002/(SICI)1098-1004(200001)15:1<99::AID-HUMU18>3.0.CO