Snpdat: Easy and rapid annotation of results from de novo snp discovery projects for model and non-model organisms

被引:39
作者
Doran, Anthony G. [1 ,2 ]
Creevey, Christopher J. [1 ]
机构
[1] TEAGASC, Teagasc Anim & Biosci Res Dept, Anim & Grassland Res & Innovat Ctr, Dunsany, Meath, Ireland
[2] NUI Maynooth, Dept Biol, Mol Evolut & Bioinformat Unit, Maynooth, Kildare, Ireland
来源
BMC BIOINFORMATICS | 2013年 / 14卷
基金
爱尔兰科学基金会;
关键词
SNPs; Annotation; Software; Non-model organisms; NUCLEOTIDE POLYMORPHISM; FUNCTIONAL ANNOTATION; WEB DATABASE; GENOME;
D O I
10.1186/1471-2105-14-45
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Single nucleotide polymorphisms (SNPs) are the most abundant genetic variant found in vertebrates and invertebrates. SNP discovery has become a highly automated, robust and relatively inexpensive process allowing the identification of many thousands of mutations for model and non-model organisms. Annotating large numbers of SNPs can be a difficult and complex process. Many tools available are optimised for use with organisms densely sampled for SNPs, such as humans. There are currently few tools available that are species non-specific or support non-model organism data. Results: Here we present SNPdat, a high throughput analysis tool that can provide a comprehensive annotation of both novel and known SNPs for any organism with a draft sequence and annotation. Using a dataset of 4,566 SNPs identified in cattle using high-throughput DNA sequencing we demonstrate the annotations performed and the statistics that can be generated by SNPdat. Conclusions: SNPdat provides users with a simple tool for annotation of genomes that are either not supported by other tools or have a small number of annotated SNPs available. SNPdat can also be used to analyse datasets from organisms which are densely sampled for SNPs. As a command line tool it can easily be incorporated into existing SNP discovery pipelines and fills a niche for analyses involving non-model organisms that are not supported by many available SNP annotation tools. SNPdat will be of great interest to scientists involved in SNP discovery and analysis projects, particularly those with limited bioinformatics experience.
引用
收藏
页数:6
相关论文
共 18 条
[1]   Present and future applications of DNA technologies to improve beef production [J].
Allan, M. F. ;
Smith, T. P. L. .
MEAT SCIENCE, 2008, 80 (01) :79-85
[2]   An SNP map of the human genome generated by reduced representation shotgun sequencing [J].
Altshuler, D ;
Pollara, VJ ;
Cowles, CR ;
Van Etten, WJ ;
Baldwin, J ;
Linton, L ;
Lander, ES .
NATURE, 2000, 407 (6803) :513-516
[3]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[4]   SNPnexus: a web database for functional annotation of newly discovered and public domain single nucleotide polymorphisms [J].
Chelala, Claude ;
Khan, Arshad ;
Lemoine, Nicholas R. .
BIOINFORMATICS, 2009, 25 (05) :655-661
[5]   SNP discovery and molecular evolution in Anopheles gambiae, with special emphasis on innate immune system [J].
Cohuet, Anna ;
Krishnakumar, Sujatha ;
Simard, Frederic ;
Morlais, Isabelle ;
Koutsos, Anastasios ;
Fontenille, Didier ;
Mindrinos, Michael ;
Kafatos, Fotis C. .
BMC GENOMICS, 2008, 9 (1)
[6]   Extreme Evolutionary Disparities Seen in Positive Selection across Seven Complex Diseases [J].
Corona, Erik ;
Dudley, Joel T. ;
Butte, Atul J. .
PLOS ONE, 2010, 5 (08)
[7]   Natural selection shapes nucleotide polymorphism across the genome of the nematode Caenorhabditis briggsae [J].
Cutter, Asher D. ;
Choi, Jae Young .
GENOME RESEARCH, 2010, 20 (08) :1103-1111
[8]   Ensembl 2012 [J].
Flicek, Paul ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fairley, Susan ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Gordon, Leo ;
Hendrix, Maurice ;
Hourlier, Thibaut ;
Johnson, Nathan ;
Kaehaeri, Andreas K. ;
Keefe, Damian ;
Keenan, Stephen ;
Kinsella, Rhoda ;
Komorowska, Monika ;
Koscielny, Gautier ;
Kulesha, Eugene ;
Larsson, Pontus ;
Longden, Ian ;
McLaren, William ;
Muffato, Matthieu ;
Overduin, Bert ;
Pignatelli, Miguel ;
Pritchard, Bethan ;
Riat, Harpreet Singh ;
Ritchie, Graham R. S. ;
Ruffier, Magali ;
Schuster, Michael ;
Sobral, Daniel ;
Tang, Y. Amy ;
Taylor, Kieron ;
Trevanion, Stephen ;
Vandrovcova, Jana ;
White, Simon ;
Wilson, Mark ;
Wilder, Steven P. ;
Aken, Bronwen L. ;
Birney, Ewan ;
Cunningham, Fiona ;
Dunham, Ian ;
Durbin, Richard ;
Fernandez-Suarez, Xose M. ;
Harrow, Jennifer ;
Herrero, Javier ;
Hubbard, Tim J. P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D84-D90
[9]   FunctSNP: an R package to link SNPs to functional knowledge and dbAutoMaker: a suite of Perl scripts to build SNP databases [J].
Goodswen, Stephen J. ;
Gondro, Cedric ;
Watson-Haigh, Nathan S. ;
Kadarmideen, Haja N. .
BMC BIOINFORMATICS, 2010, 11
[10]   Single nucleotide polymorphism markers for genetic mapping in Drosophila melanogaster [J].
Hoskins, RA ;
Phan, AC ;
Naeemuddin, M ;
Mapa, FA ;
Ruddy, DA ;
Ryan, JJ ;
Young, LM ;
Wells, T ;
Kopczynski, C ;
Ellis, MC .
GENOME RESEARCH, 2001, 11 (06) :1100-1113