A long reads-based de-novo assembly of the genome of the Arlee homozygous line reveals chromosomal rearrangements in rainbow trout

被引:50
作者
Gao, Guangtu [1 ]
Magadan, Susana [2 ]
Waldbieser, Geoffrey C. [3 ]
Youngblood, Ramey C. [4 ]
Wheeler, Paul A. [5 ,6 ]
Scheffler, Brian E. [7 ]
Thorgaard, Gary H. [5 ,6 ]
Palti, Yniv [1 ]
机构
[1] USDA ARS, Natl Ctr Cool & Cold Water Aquaculture, 11861 Leetown Rd, Kearneysville, WV 25430 USA
[2] Univ Vigo, Ctr Invest Biomed, Campus Univ Lagoas Marcosende, Vigo 36310, Spain
[3] USDA ARS, Warmwater Aquaculture Res Unit, Stoneville, MS 38776 USA
[4] Mississippi State Univ, Inst Genom Biocomp & Biotechnol, Starkville, MS 39762 USA
[5] Washington State Univ, Sch Biol Sci, Pullman, WA 99164 USA
[6] Washington State Univ, Ctr Reprod Biol, Pullman, WA 99164 USA
[7] USDA ARS, Genom & Bioinformat Res Unit, Stoneville, MS 38776 USA
基金
美国食品与农业研究所;
关键词
reference genome; rainbow trout; pan-genome; structural variance; arlee; de-novo assembly; IGH; HEAVY-CHAIN; PROVIDES; DNA; DUPLICATION; INSIGHTS; GENE;
D O I
10.1093/g3journal/jkab052
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Currently, there is still a need to improve the contiguity of the rainbow trout reference genome and to use multiple genetic backgrounds that will represent the genetic diversity of this species. The Arlee doubled haploid line was originated from a domesticated hatchery strain that was originally collected from the northern California coast. The Canu pipeline was used to generate the Arlee line genome de-novo assembly from high coverage PacBio long-reads sequence data. The assembly was further improved with Bionano optical maps and Hi-C proximity ligation sequence data to generate 32 major scaffolds corresponding to the karyotype of the Arlee line (2 N = 64). It is composed of 938 scaffolds with N50 of 39.16 Mb and a total length of 2.33 Gb, of which similar to 95% was in 32 chromosome sequences with only 438 gaps between contigs and scaffolds. In rainbow trout the haploid chromosome number can vary from 29 to 32. In the Arlee karyotype the haploid chromosome number is 32 because chromosomes Omy04, 14 and 25 are divided into six acrocentric chromosomes. Additional structural variations that were identified in the Arlee genome included the major inversions on chromosomes Omy05 and Omy20 and additional 15 smaller inversions that will require further validation. This is also the first rainbow trout genome assembly that includes a scaffold with the sex-determination gene (sdY) in the chromosome Y sequence. The utility of this genome assembly is shown through the improved annotation of the duplicated genome loci that harbor the IGH genes on chromosomes Omy12 and Omy13.
引用
收藏
页数:11
相关论文
共 44 条
[1]  
Allendorf F.W., 1984, P1
[2]   Effects of Crossovers Between Homeologs on Inheritance and Population Genomics in Polyploid-Derived Salmonid Fishes [J].
Allendorf, Fred W. ;
Bassham, Susan ;
Cresko, William A. ;
Limborg, Morten T. ;
Seeb, Lisa W. ;
Seeb, James E. .
JOURNAL OF HEREDITY, 2015, 106 (03) :217-227
[3]  
Allendorf FW, 1997, GENETICS, V145, P1083
[4]   Crop genomes and beyond [J].
不详 .
NATURE GENETICS, 2020, 52 (09) :865-865
[5]   Automated de novo identification of repeat sequence families in sequenced genomes [J].
Bao, ZR ;
Eddy, SR .
GENOME RESEARCH, 2002, 12 (08) :1269-1276
[6]   The rainbow trout genome provides novel insights into evolution after whole-genome duplication in vertebrates [J].
Berthelot, Camille ;
Brunet, Frederic ;
Chalopin, Domitille ;
Juanchich, Amelie ;
Bernard, Maria ;
Noel, Benjamin ;
Bento, Pascal ;
Da Silva, Corinne ;
Labadie, Karine ;
Alberti, Adriana ;
Aury, Jean-Marc ;
Louis, Alexandra ;
Dehais, Patrice ;
Bardou, Philippe ;
Montfort, Jerome ;
Klopp, Christophe ;
Cabau, Cedric ;
Gaspin, Christine ;
Thorgaard, Gary H. ;
Boussaha, Mekki ;
Quillet, Edwige ;
Guyomard, Rene ;
Galiana, Delphine ;
Bobe, Julien ;
Volff, Jean-Nicolas ;
Genet, Carine ;
Wincker, Patrick ;
Jaillon, Olivier ;
Roest Crollius, Hugues ;
Guiguen, Yann .
NATURE COMMUNICATIONS, 2014, 5
[7]   Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome [J].
Bickhart, Derek M. ;
Rosen, Benjamin D. ;
Koren, Sergey ;
Sayre, Brian L. ;
Hastie, Alex R. ;
Chan, Saki ;
Lee, Joyce ;
Lam, Ernest T. ;
Liachko, Ivan ;
Sullivan, Shawn T. ;
Burton, Joshua N. ;
Huson, Heather J. ;
Nystrom, John C. ;
Kelley, Christy M. ;
Hutchison, Jana L. ;
Zhou, Yang ;
Sun, Jiajie ;
Crisa, Alessandra ;
de Leon, F. Abel Ponce ;
Schwartz, John C. ;
Hammond, John A. ;
Waldbieser, Geoffrey C. ;
Schroeder, Steven G. ;
Liu, George E. ;
Dunham, Maitreya J. ;
Shendure, Jay ;
Sonstegard, Tad S. ;
Phillippy, Adam M. ;
Van Tassell, Curtis P. ;
Smith, Timothy P. L. .
NATURE GENETICS, 2017, 49 (04) :643-+
[8]   Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom [J].
Durand, Neva C. ;
Robinson, James T. ;
Shamim, Muhammad S. ;
Machol, Ido ;
Mesirov, Jill P. ;
Lander, Eric S. ;
Aiden, Erez Lieberman .
CELL SYSTEMS, 2016, 3 (01) :99-101
[9]   Presence of an unique IgT on the IGH locus in three-spined stickleback fish (Gasterosteus aculeatus) and the very recent generation of a repertoire of VH genes [J].
Gambon-Deza, Francisco ;
Sanchez-Espinel, Christian ;
Magadan-Mompo, Susana .
DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY, 2010, 34 (02) :114-122
[10]   A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing [J].
Gao, Guangtu ;
Nome, Torfinn ;
Pearse, Devon E. ;
Moen, Thomas ;
Naish, Kerry A. ;
Thorgaard, Gary H. ;
Lien, Sigbjorn ;
Palti, Yniv .
FRONTIERS IN GENETICS, 2018, 9