SNP development from RNA-seq data in a nonmodel fish: how many individuals are needed for accurate allele frequency prediction?

被引:39
作者
Schunter, C. [1 ,2 ]
Garza, J. C. [3 ]
Macpherson, E. [1 ]
Pascual, M. [2 ]
机构
[1] CSIC, CEAB, Blanes 17300, Spain
[2] Univ Barcelona, Dept Genet, E-08028 Barcelona, Spain
[3] Natl Marine Fisheries Serv, Southwest Fisheries Sci Ctr, Santa Cruz, CA 95060 USA
关键词
minor allele frequency; non-model species; RNA-seq; SNP development; Tripterygion delaisi; SINGLE-NUCLEOTIDE POLYMORPHISMS; SEQUENCE; PARENTAGE; PISCES; EVOLUTION; DISCOVERY; GENOMICS; MARKERS; TOOL;
D O I
10.1111/1755-0998.12155
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Single nucleotide polymorphisms (SNPs) are rapidly becoming the marker of choice in population genetics due to a variety of advantages relative to other markers, including higher genomic density, data quality, reproducibility and genotyping efficiency, as well as ease of portability between laboratories. Advances in sequencing technology and methodologies to reduce genomic representation have made the isolation of SNPs feasible for nonmodel organisms. RNA-seq is one such technique for the discovery of SNPs and development of markers for large-scale genotyping. Here, we report the development of 192 validated SNP markers for parentage analysis in Tripterygion delaisi (the black-faced blenny), a small rocky-shore fish from the Mediterranean Sea. RNA-seq data for 15 individual samples were used for SNP discovery by applying a series of selection criteria. Genotypes were then collected from 1599 individuals from the same population with the resulting loci. Differences in heterozygosity and allele frequencies were found between the two data sets. Heterozygosity was lower, on average, in the population sample, and the mean difference between the frequencies of particular alleles in the two data sets was 0.135 +/- 0.100. We used bootstrap resampling of the sequence data to predict appropriate sample sizes for SNP discovery. As cDNA library production is time-consuming and expensive, we suggest that using seven individuals for RNA sequencing reduces the probability of discarding highly informative SNP loci, due to lack of observed polymorphism, whereas use of more than 12 samples does not considerably improve prediction of true allele frequencies.
引用
收藏
页码:157 / 165
页数:9
相关论文
共 38 条
[1]   An SNP map of the human genome generated by reduced representation shotgun sequencing [J].
Altshuler, D ;
Pollara, VJ ;
Cowles, CR ;
Van Etten, WJ ;
Baldwin, J ;
Linton, L ;
Lander, ES .
NATURE, 2000, 407 (6803) :513-516
[2]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[3]   The power of single-nucleotide polymorphisms for large-scale parentage inference [J].
Anderson, EC ;
Garza, JC .
GENETICS, 2006, 172 (04) :2567-2582
[4]   Genomic basis for coral resilience to climate change [J].
Barshis, Daniel J. ;
Ladner, Jason T. ;
Oliver, Thomas A. ;
Seneca, Francois O. ;
Traylor-Knowles, Nikki ;
Palumbi, Stephen R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (04) :1387-1392
[5]   Estimation of the number of SNP genetic markers required for parentage verification [J].
Baruch, E. ;
Weller, J. I. .
ANIMAL GENETICS, 2008, 39 (05) :474-479
[6]   The utility of single nucleotide polymorphisms in inferences of population history [J].
Brumfield, RT ;
Beerli, P ;
Nickerson, DA ;
Edwards, SV .
TRENDS IN ECOLOGY & EVOLUTION, 2003, 18 (05) :249-256
[7]   Population structure within and between subspecies of the Mediterranean triplefin fish Tripterygion delaisi revealed by highly polymorphic microsatellite loci [J].
Carreras-Carbonell, J. ;
Macpherson, E. ;
Pascual, M. .
MOLECULAR ECOLOGY, 2006, 15 (12) :3527-3539
[8]   Rapid radiation and cryptic speciation in Mediterranean triplefin blennies (Pisces:Tripterygiidae) combining multiple genes [J].
Carreras-Carbonell, J ;
Macpherson, E ;
Pascual, M .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2005, 37 (03) :751-761
[9]   Blast2GO:: a universal tool for annotation, visualization and analysis in functional genomics research [J].
Conesa, A ;
Götz, S ;
García-Gómez, JM ;
Terol, J ;
Talón, M ;
Robles, M .
BIOINFORMATICS, 2005, 21 (18) :3674-3676
[10]   Transcriptome-wide polymorphisms of red abalone (Haliotis rufescens) reveal patterns of gene flow and local adaptation [J].
De Wit, Pierre ;
Palumbi, Stephen R. .
MOLECULAR ECOLOGY, 2013, 22 (11) :2884-2897