Development and characterisation of an expressed sequence tags (EST)-derived single nucleotide polymorphisms (SNPs) resource in rainbow trout

被引:16
作者
Boussaha, Mekki [1 ]
Guyomard, Rene [1 ]
Cabau, Cedric [2 ]
Esquerre, Diane [3 ]
Quillet, Edwige [1 ]
机构
[1] INRA, UMR Genet Anim & Biol Integrat 1313, F-78350 Jouy En Josas, France
[2] INRA, SIGENAE Rech Avicoles UR83, F-37380 Nouzilly, France
[3] INRA, UMR 444, Lab Genet Cellulaire Plateforme GET, Castanet Tolosan, Tolosan, France
关键词
SALMON SALMO-SALAR; ATLANTIC SALMON; LINKAGE MAP; GENOME; LOCI; IDENTIFICATION; DISCOVERY; INFERENCE; SOFTWARE; QTL;
D O I
10.1186/1471-2164-13-238
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: There is considerable interest in developing high-throughput genotyping with single nucleotide polymorphisms (SNPs) for the identification of genes affecting important ecological or economical traits. SNPs are evenly distributed throughout the genome and are likely to be functionally relevant. In rainbow trout, in silico screening of EST databases represents an attractive approach for de novo SNP identification. Nevertheless, EST sequencing errors and assembly of EST paralogous sequences can lead to the identification of false positive SNPs which renders the reliability of EST-derived SNPs relatively low. Further validation of EST-derived SNPs is therefore required. The objective of this work was to assess the quality of and to validate a large number of rainbow trout EST-derived SNPs. Results: A panel of 1,152 EST-derived SNPs was selected from the INRA Sigenae SNP database and was genotyped in standard and double haploid individuals from several populations using the Illumina GoldenGate BeadXpress assay. High-quality genotyping data were obtained for 958 SNPs representing a genotyping success rate of 83.2 %, out of which, 350 SNPs (36.5 %) were polymorphic in at least one population and were designated as true SNPs. They also proved to be a potential tool to investigate genetic diversity of the species, as the set of SNP successfully sorted individuals into three main groups using STRUCTURE software. Functional annotations revealed 28 non-synonymous SNPs, out of which four substitutions were predicted to affect protein functions. A subset of 223 true SNPs were polymorphic in the two INRA mapping reference families and were integrated into the INRA microsatellite-based linkage map. Conclusions: Our results represent the first study of EST-derived SNPs validation in rainbow trout, a species whose genome sequences is not yet available. We designed several specific filters in order to improve the genotyping yield. Nevertheless, our selection criteria should be further improved in order to reduce the observed high rate of false positive SNPs which results from the occurrence of whole genome duplications.
引用
收藏
页数:11
相关论文
共 45 条
[1]  
Allendorf FW, 1984, EVOLUTIONARY GENETIC
[2]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[3]   Distribution of ancestral proto-Actinopterygian chromosome arms within the genomes of 4R-derivative salmonid fishes (Rainbow trout and Atlantic salmon) [J].
Danzmann, Roy G. ;
Davidson, Evelyn A. ;
Ferguson, Moira M. ;
Gharbi, Karim ;
Koop, Ben F. ;
Hoyheim, Bjorn ;
Lien, Sigbjorn ;
Lubieniecki, Krzysztof P. ;
Moghadam, Hooman K. ;
Park, Jay ;
Phillips, Ruth B. ;
Davidson, William S. .
BMC GENOMICS, 2008, 9 (1)
[4]   CARHTAGene:: multipopulation integrated genetic and radiation hybrid mapping [J].
de Givry, S ;
Bouchez, M ;
Chabrier, P ;
Milan, D ;
Schiex, T .
BIOINFORMATICS, 2005, 21 (08) :1703-1704
[5]   MUTAGENIC DEAMINATION OF CYTOSINE RESIDUES IN DNA [J].
DUNCAN, BK ;
MILLER, JH .
NATURE, 1980, 287 (5782) :560-561
[6]   Homoplasy and mutation model at microsatellite loci and their consequences for population genetics analysis [J].
Estoup, A ;
Jarne, P ;
Cornuet, JM .
MOLECULAR ECOLOGY, 2002, 11 (09) :1591-1604
[7]   Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study [J].
Evanno, G ;
Regnaut, S ;
Goudet, J .
MOLECULAR ECOLOGY, 2005, 14 (08) :2611-2620
[8]  
Falush D, 2003, GENETICS, V164, P1567
[9]   Highly parallel SNP genotyping [J].
Fan, JB ;
Oliphant, A ;
Shen, R ;
Kermani, BG ;
Garcia, F ;
Gunderson, KL ;
Hansen, M ;
Steemers, F ;
Butler, SL ;
Deloukas, P ;
Galver, L ;
Hunt, S ;
McBride, C ;
Bibikova, M ;
Rubano, T ;
Chen, J ;
Wickham, E ;
Doucet, D ;
Chang, W ;
Campbell, D ;
Zhang, B ;
Kruglyak, S ;
Bentley, D ;
Haas, J ;
Rigault, P ;
Zhou, L ;
Stuelpnagel, J ;
Chee, MS .
COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY, 2003, 68 :69-78
[10]   Identification of candidate coding region single nucleotide polymorphisms in 165 human genes using assembled expressed sequence tags [J].
Garg, K ;
Green, P ;
Nickerson, DA .
GENOME RESEARCH, 1999, 9 (11) :1087-1092