ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data

被引:10775
作者
Wang, Kai [1 ]
Li, Mingyao [2 ]
Hakonarson, Hakon [1 ,3 ]
机构
[1] Childrens Hosp Philadelphia, Ctr Appl Genom, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
[3] Univ Penn, Dept Pediat, Philadelphia, PA 19104 USA
关键词
SNPS; ASSOCIATION; GENOMES;
D O I
10.1093/nar/gkq603
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput sequencing platforms are generating massive amounts of genetic variation data for diverse genomes, but it remains a challenge to pinpoint a small subset of functionally important variants. To fill these unmet needs, we developed the ANNOVAR tool to annotate single nucleotide variants (SNVs) and insertions/deletions, such as examining their functional consequence on genes, inferring cytogenetic bands, reporting functional importance scores, finding variants in conserved regions, or identifying variants reported in the 1000 Genomes Project and dbSNP. ANNOVAR can utilize annotation databases from the UCSC Genome Browser or any annotation data set conforming to Generic Feature Format version 3 (GFF3). We also illustrate a 'variants reduction' protocol on 4.7 million SNVs and indels from a human genome, including two causal mutations for Miller syndrome, a rare recessive disease. Through a stepwise procedure, we excluded variants that are unlikely to be causal, and identified 20 candidate genes including the causal gene. Using a desktop computer, ANNOVAR requires similar to 4 min to perform gene-based annotation and similar to 15 min to perform variants reduction on 4.7 million variants, making it practical to handle hundreds of human genomes in a day. ANNOVAR is freely available at http://www.openbioinformatics.org/annovar/.
引用
收藏
页数:7
相关论文
共 50 条
[31]   Sharing of photobionts in sympatric populations of Thamnolia and Cetraria lichens: evidence from high-throughput sequencing [J].
Onut-Brannstrom, Ioana ;
Benjamin, Mitchell ;
Scofield, Douglas G. ;
Heidmarsson, Starri ;
Andersson, Martin G. I. ;
Lindstrom, Eva S. ;
Johannesson, Hanna .
SCIENTIFIC REPORTS, 2018, 8
[32]   MULTISCALE POISSON PROCESS APPROACHES FOR DETECTING AND ESTIMATING DIFFERENCES FROM HIGH-THROUGHPUT SEQUENCING ASSAYS [J].
Shim, Heejung ;
Xing, Zhengrong ;
Pantaleo, Ester ;
Luca, Francesca ;
Pique-Regi, Roger ;
Stephens, Matthew .
ANNALS OF APPLIED STATISTICS, 2024, 18 (03) :1773-1788
[33]   Mapping the Burkholderia cenocepacia niche response via high-throughput sequencing [J].
Yoder-Himes, D. R. ;
Chain, P. S. G. ;
Zhu, Y. ;
Wurtzel, O. ;
Rubin, E. M. ;
Tiedje, James M. ;
Sorek, R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (10) :3976-3981
[34]   LDx: Estimation of Linkage Disequilibrium from High-Throughput Pooled Resequencing Data [J].
Feder, Alison F. ;
Petrov, Dmitri A. ;
Bergland, Alan O. .
PLOS ONE, 2012, 7 (11)
[35]   Considerations for Optimization of High-Throughput Sequencing Bioinformatics Pipelines for Virus Detection [J].
Lambert, Christophe ;
Braxton, Cassandra ;
Charlebois, Robert L. ;
Deyati, Avisek ;
Duncan, Paul ;
La Neve, Fabio ;
Malicki, Heather D. ;
Ribrioux, Sebastien ;
Rozelle, Daniel K. ;
Michaels, Brandye ;
Sun, Wenping ;
Yang, Zhihui ;
Khan, Arifa S. .
VIRUSES-BASEL, 2018, 10 (10)
[36]   HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing [J].
Weissensteiner, Hansi ;
Pacher, Dominic ;
Kloss-Brandstaetter, Anita ;
Forer, Lukas ;
Specht, Guenther ;
Bandelt, Hans-Juergen ;
Kronenberg, Florian ;
Salas, Antonio ;
Schoenherr, Sebastian .
NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) :W58-W63
[37]   Identifying micro-inversions using high-throughput sequencing reads [J].
He, Feifei ;
Li, Yang ;
Tang, Yu-Hang ;
Ma, Jian ;
Zhu, Huaiqiu .
BMC GENOMICS, 2016, 17
[38]   Single-nucleotide polymorphism discovery by high-throughput sequencing in sorghum [J].
Nelson, James C. ;
Wang, Shichen ;
Wu, Yuye ;
Li, Xianran ;
Antony, Ginny ;
White, Frank F. ;
Yu, Jianming .
BMC GENOMICS, 2011, 12
[39]   High-throughput single-cell sequencing of activated sludge microbiome [J].
Zhang, Yulin ;
Xue, Bingjie ;
Mao, Yanping ;
Chen, Xi ;
Yan, Weifu ;
Wang, Yanren ;
Wang, Yulin ;
Liu, Lei ;
Yu, Jiale ;
Zhang, Xiaojin ;
Chao, Shan ;
Topp, Edward ;
Zheng, Wenshan ;
Zhang, Tong .
ENVIRONMENTAL SCIENCE AND ECOTECHNOLOGY, 2025, 23
[40]   The CNVrd2 package: measurement of copy number at complex loci using high-throughput sequencing data [J].
Nguyen, Hoang T. ;
Merriman, Tony R. ;
Black, Michael A. .
FRONTIERS IN GENETICS, 2014, 5