Human variation database: an open-source database template for genomic discovery

被引:9
作者
Fejes, Anthony P. [1 ]
Khodabakhshi, Alireza Hadj [1 ]
Birol, Inanc [1 ]
Jones, Steven J. M. [1 ]
机构
[1] BC Canc Agcy, Genome Sci Ctr, Vancouver, BC V5Z 4S6, Canada
关键词
D O I
10.1093/bioinformatics/btr100
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Current public variation databases are based upon collaboratively pooling data into a single database with a single interface available to the public. This gives little control to the collaborator to mine the database and requires that they freely share their data with the owners of the repository. We aim to provide an alternative mechanism: providing the source code and application programming interface (API) of a database, enabling researchers to set up local versions without investing heavily in the development of the resource and allowing for confidential information to remain secure. Results: We describe an open-source database that can be installed easily at any research facility for the storage and analysis of thousands of next-generation sequencing variations. This database is built using PostgreSQL 8.4 (The PostgreSQL Global Development Group. postgres 8.4: http://www.postgresql.org) and provides a novel method for collating and searching across the reported results from thousands of next-generation sequence samples, as well as rapidly accessing vital information on the origin of the samples. The schema of the database makes rapid and insightful queries simple and enables easy annotation of novel or known genetic variations. A modular and cross-platform Java API is provided to perform common functions, such as generation of standard experimental reports and graphical summaries of modifications to genes. Included libraries allow adopters of the database to quickly develop their own queries.
引用
收藏
页码:1155 / 1156
页数:2
相关论文
共 5 条
[1]   GWAS Analyzer: integrating genotype, phenotype and public annotation data for genome-wide association study analysis [J].
Fong, Christine ;
Ko, Dennis C. ;
Wasnick, Michael ;
Radey, Matthew ;
Miller, Samuel I. ;
Brittnacher, Mitchell .
BIOINFORMATICS, 2010, 26 (04) :560-564
[2]   Ensembl 2007 [J].
Hubbard, T. J. P. ;
Aken, B. L. ;
Beal, K. ;
Ballester, B. ;
Caccamo, M. ;
Chen, Y. ;
Clarke, L. ;
Coates, G. ;
Cunningham, F. ;
Cutts, T. ;
Down, T. ;
Dyer, S. C. ;
Fitzgerald, S. ;
Fernandez-Banet, J. ;
Graf, S. ;
Haider, S. ;
Hammond, M. ;
Herrero, J. ;
Holland, R. ;
Howe, K. ;
Howe, K. ;
Johnson, N. ;
Kahari, A. ;
Keefe, D. ;
Kokocinski, F. ;
Kulesha, E. ;
Lawson, D. ;
Longden, I. ;
Melsopp, C. ;
Megy, K. ;
Meidl, P. ;
Overduin, B. ;
Parker, A. ;
Prlic, A. ;
Rice, S. ;
Rios, D. ;
Schuster, M. ;
Sealy, I. ;
Severin, J. ;
Slater, G. ;
Smedley, D. ;
Spudich, G. ;
Trevanion, S. ;
Vilella, A. ;
Vogel, J. ;
White, S. ;
Wood, M. ;
Cox, T. ;
Curwen, V. ;
Durbin, R. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D610-D617
[3]   The Sequence Alignment/Map format and SAMtools [J].
Li, Heng ;
Handsaker, Bob ;
Wysoker, Alec ;
Fennell, Tim ;
Ruan, Jue ;
Homer, Nils ;
Marth, Gabor ;
Abecasis, Goncalo ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (16) :2078-2079
[4]   Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution [J].
Shah, Sohrab P. ;
Morin, Ryan D. ;
Khattra, Jaswinder ;
Prentice, Leah ;
Pugh, Trevor ;
Burleigh, Angela ;
Delaney, Allen ;
Gelmon, Karen ;
Guliany, Ryan ;
Senz, Janine ;
Steidl, Christian ;
Holt, Robert A. ;
Jones, Steven ;
Sun, Mark ;
Leung, Gillian ;
Moore, Richard ;
Severson, Tesa ;
Taylor, Greg A. ;
Teschendorff, Andrew E. ;
Tse, Kane ;
Turashvili, Gulisa ;
Varhol, Richard ;
Warren, Rene L. ;
Watson, Peter ;
Zhao, Yongjun ;
Caldas, Carlos ;
Huntsman, David ;
Hirst, Martin ;
Marra, Marco A. ;
Aparicio, Samuel .
NATURE, 2009, 461 (7265) :809-U67
[5]   dbSNP: the NCBI database of genetic variation [J].
Sherry, ST ;
Ward, MH ;
Kholodov, M ;
Baker, J ;
Phan, L ;
Smigielski, EM ;
Sirotkin, K .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :308-311