The Rhododendron Genome and Chromosomal Organization Provide Insight into SharedWhole-Genome Duplications across the Heath Family (Ericaceae)

被引:54
作者
Soza, Valerie L. [1 ]
Lindsley, Dale [1 ]
Waalkes, Adam [1 ,5 ]
Ramage, Elizabeth [1 ]
Patwardhan, Rupali P. [2 ]
Burton, Joshua N. [2 ,6 ]
Adey, Andrew [2 ,7 ]
Kumar, Akash [2 ,8 ]
Qiu, Ruolan [2 ]
Shendure, Jay [2 ,3 ,4 ]
Hall, Benjamin [1 ]
机构
[1] Univ Washington, Dept Biol, Seattle, WA 98195 USA
[2] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[3] Brotman Baty Inst Precis Med, Seattle, WA USA
[4] Univ Washington, Howard Hughes Med Inst, Seattle, WA 98195 USA
[5] Univ Washington, Dept Lab Med, Seattle, WA 98195 USA
[6] Adapt Biotechnol, Seattle, WA USA
[7] Oregon Hlth & Sci Univ, Dept Mol & Med Genet, Portland, OR 97201 USA
[8] Stanford Univ, Dept Pediat, Palo Alto, CA 94304 USA
基金
美国国家科学基金会;
关键词
chromatin conformation capture (Hi-C); chromosome-scale scaffolding; de novo genome assembly; linkage map; restriction-site associated DNA (RAD) sequencing; synteny; DE-NOVO IDENTIFICATION; PHYLOGENETIC ANALYSIS; TOOL; ANNOTATION; EVOLUTION; GENE; DATABASE; PATTERNS; INFORMATION; COMPLEXITY;
D O I
10.1093/gbe/evz245
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The genus Rhododendron (Ericaceae), which includes horticulturally important plants such as azaleas, is a highly diverse and widely distributed genus of >1,000 species. Here, we report the chromosome-scale de novo assembly and genome annotation of Rhododendron williamsianum as a basis for continued study of this large genus. We created multiple short fragment genomic libraries, which were assembled using ALLPATHS-LG. This was followed by contiguity preserving transposase sequencing (CPT-seq) and fragScaff scaffolding of a large fragment library, which improved the assembly by decreasing the number of scaffolds and increasing scaffold length. Chromosome-scale scaffolding was performed by proximity-guided assembly (LACHESIS) using chromatin conformation capture (Hi-C) data. Chromosome-scale scaffolding was further refined and linkage groups defined by restriction-site associated DNA(RAD) sequencing of the parents and progeny of a genetic cross. The resulting linkage map confirmed the LACHESIS clustering and ordering of scaffolds onto chromosomes and rectified large-scale inversions. Assessments of the R. williamsianum genome assembly and gene annotation estimate them to be 89% and 79% complete, respectively. Predicted coding sequences from genome annotation were used in syntenic analyses and for generating age distributions of synonymous substitutions/site between paralgous gene pairs, which identified whole-genome duplications (WGDs) in R. williamsianum. We then analyzed other publicly available Ericaceae genomes for shared WGDs. Based on our spatial and temporal analyses of paralogous gene pairs, we find evidence for two shared, ancient WGDs in Rhododendron and Vaccinium (cranberry/blueberry) members that predate the Ericaceae family and, in one case, the Ericales order.
引用
收藏
页码:3353 / 3371
页数:19
相关论文
共 111 条
[1]   In vitro, long-range sequence information for de novo genome assembly via transposase contiguity [J].
Adey, Andrew ;
Kitzman, Jacob O. ;
Burton, Joshua N. ;
Daza, Riza ;
Kumar, Akash ;
Christiansen, Lena ;
Ronaghi, Mostafa ;
Amini, Sasan ;
Gunderson, Kevin L. ;
Steemers, Frank J. ;
Shendure, Jay .
GENOME RESEARCH, 2014, 24 (12) :2041-2049
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]   Haplotype-resolved whole-genome sequencing by contiguity-preserving transposition and combinatorial indexing [J].
Amini, Sasan ;
Pushkarev, Dmitry ;
Christiansen, Lena ;
Kostem, Emrah ;
Royce, Tom ;
Turk, Casey ;
Pignatelli, Natasha ;
Adey, Andrew ;
Kitzman, Jacob O. ;
Vijayan, Kandaswamy ;
Ronaghi, Mostafa ;
Shendure, Jay ;
Gunderson, Kevin L. ;
Steemers, Frank J. .
NATURE GENETICS, 2014, 46 (12) :1343-1349
[5]  
[Anonymous], 1982, SUBG HYMEN NOTES R B
[6]  
[Anonymous], 2000, GRAPH CLUSTERING FLO
[7]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[8]   Automated de novo identification of repeat sequence families in sequenced genomes [J].
Bao, ZR ;
Eddy, SR .
GENOME RESEARCH, 2002, 12 (08) :1269-1276
[9]   Multiple Paleopolyploidizations during the Evolution of the Compositae Reveal Parallel Patterns of Duplicate Gene Retention after Millions of Years [J].
Barker, Michael S. ;
Kane, Nolan C. ;
Matvienko, Marta ;
Kozik, Alexander ;
Michelmore, W. ;
Knapp, Steven J. ;
Rieseberg, Loren H. .
MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (11) :2445-2455
[10]   UniProt: the universal protein knowledgebase [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Bye-A-Jee, Hema ;
Cowley, Andrew ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Castro, Leyla Garcia ;
Figueira, Luis ;
Garmiri, Penelope ;
Georghiou, George ;
Gonzalez, Daniel ;
Hatton-Ellis, Emma ;
Li, Weizhong ;
Liu, Wudong ;
Lopez, Rodrigo ;
Luo, Jie ;
Lussi, Yvonne ;
MacDougall, Alistair ;
Nightingale, Andrew ;
Palka, Barbara ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Speretta, Elena ;
Turner, Edward ;
Tyagi, Nidhi ;
Volynkin, Vladimir ;
Wardell, Tony ;
Warner, Kate ;
Watkins, Xavier ;
Zaru, Rossana ;
Zellner, Hermann ;
Xenarios, Ioannis .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D158-D169