A chromosome-scale genome assembly of cucumber (Cucumis sativus L.)

被引:176
作者
Li, Qing [1 ]
Li, Hongbo [1 ]
Huang, Wu [1 ,2 ]
Xu, Yuanchao [1 ]
Zhou, Qian [1 ,2 ]
Wang, Shenhao [3 ]
Ruan, Jue [2 ]
Huang, Sanwen [2 ]
Zhang, Zhonghua [1 ]
机构
[1] Chinese Acad Agr Sci, Inst Vegetables & Flowers, 12 Haidian Dist, Beijing 100081, Peoples R China
[2] Chinese Acad Agr Sci, Agr Genom Inst Shenzhen, 7 Pengfei Rd, Shenzhen 518124, Peoples R China
[3] Northwest A&F Univ, Coll Hort, Yangling 712100, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
cucumber; PacBio; Hi-C; genomics; chromosome-scale assembly; OPEN SOFTWARE; HI-C; IDENTIFICATION; CONFORMATION; SEQUENCE; CAPTURE; TOOL; MAP; DOMESTICATION; TRANSCRIPTOME;
D O I
10.1093/gigascience/giz072
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assemble draft cucumber genomes, but the incompleteness and low quality of these genomes limit their use in comparative genomics and genetic research. A high-quality and complete cucumber genome assembly is therefore essential. Findings: We assembled single-molecule real-time (SMRT) long reads to generate an improved cucumber reference genome. This version contains 174 contigs with a total length of 226.2 Mb and an N50 of 8.9 Mb, and provides 29.0 Mb more sequence data than previous versions. Using 10X Genomics and high-throughput chromosome conformation capture (Hi-C) data, 89 contigs (similar to 211.0 Mb) were directly linked into 7 pseudo-chromosome sequences. The newly assembled regions show much higher guanine-cytosine or adenine-thymine content than found previously, which is likely to have been inaccessible to Illumina sequencing. The new assembly contains 1,374 full-length long terminal retrotransposons and 1,078 novel genes including 239 tandemly duplicated genes. For example, we found 4 tandemly duplicated tyrosylprotein sulfotransferases, in contrast to the single copy of the gene found previously and in most other plants. Conclusion: This high-quality genome presents novel features of the cucumber genome and will serve as a valuable resource for genetic research in cucumber and plant comparative genomics.
引用
收藏
页数:10
相关论文
共 49 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   Hi-C: A comprehensive technique to capture the conformation of genomes [J].
Belton, Jon-Matthew ;
McCord, Rachel Patton ;
Gibcus, Johan Harmen ;
Naumova, Natalia ;
Zhan, Ye ;
Dekker, Job .
METHODS, 2012, 58 (03) :268-276
[3]   Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome [J].
Bickhart, Derek M. ;
Rosen, Benjamin D. ;
Koren, Sergey ;
Sayre, Brian L. ;
Hastie, Alex R. ;
Chan, Saki ;
Lee, Joyce ;
Lam, Ernest T. ;
Liachko, Ivan ;
Sullivan, Shawn T. ;
Burton, Joshua N. ;
Huson, Heather J. ;
Nystrom, John C. ;
Kelley, Christy M. ;
Hutchison, Jana L. ;
Zhou, Yang ;
Sun, Jiajie ;
Crisa, Alessandra ;
de Leon, F. Abel Ponce ;
Schwartz, John C. ;
Hammond, John A. ;
Waldbieser, Geoffrey C. ;
Schroeder, Steven G. ;
Liu, George E. ;
Dunham, Maitreya J. ;
Shendure, Jay ;
Sonstegard, Tad S. ;
Phillippy, Adam M. ;
Van Tassell, Curtis P. ;
Smith, Timothy P. L. .
NATURE GENETICS, 2017, 49 (04) :643-+
[4]   ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers [J].
Coombe, Lauren ;
Zhang, Jessica ;
Vandervalk, Benjamin P. ;
Chu, Justin ;
Jackman, Shaun D. ;
Birol, Inanc ;
Warren, Rene L. .
BMC BIOINFORMATICS, 2018, 19
[5]   High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development [J].
Daccord, Nicolas ;
Celton, Jean-Marc ;
Linsmith, Gareth ;
Becker, Claude ;
Choisne, Nathalie ;
Schijlen, Elio ;
van de Geest, Henri ;
Bianco, Luca ;
Micheletti, Diego ;
Velasco, Riccardo ;
Di Pierro, Erica Adele ;
Gouzy, Jerome ;
Rees, D. Jasper G. ;
Guerif, Philippe ;
Muranty, Helene ;
Durel, Charles-Eric ;
Laurens, Francois ;
Lespinasse, Yves ;
Gaillard, Sylvain ;
Aubourg, Sebastien ;
Quesneville, Hadi ;
Weigel, Detlef ;
van de Weg, Eric ;
Troggio, Michela ;
Bucher, Etienne .
NATURE GENETICS, 2017, 49 (07) :1099-+
[6]   Sequencing and de novo assembly of a near complete indica rice genome [J].
Du, Huilong ;
Yu, Ying ;
Ma, Yanfei ;
Gao, Qiang ;
Cao, Yinghao ;
Chen, Zhuo ;
Ma, Bin ;
Qi, Ming ;
Li, Yan ;
Zhao, Xianfeng ;
Wang, Jing ;
Liu, Kunfan ;
Qin, Peng ;
Yang, Xin ;
Zhu, Lihuang ;
Li, Shigui ;
Liang, Chengzhi .
NATURE COMMUNICATIONS, 2017, 8
[7]   De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds [J].
Dudchenko, Olga ;
Batra, Sanjit S. ;
Omer, Arina D. ;
Nyquist, Sarah K. ;
Hoeger, Marie ;
Durand, Neva C. ;
Shamim, Muhammad S. ;
Machol, Ido ;
Lander, Eric S. ;
Aiden, Aviva Presser ;
Aiden, Erez Lieberman .
SCIENCE, 2017, 356 (6333) :92-95
[8]   MUSCLE: a multiple sequence alignment method with reduced time and space complexity [J].
Edgar, RC .
BMC BIOINFORMATICS, 2004, 5 (1) :1-19
[9]   INSERTION AND AMPLIFICATION OF A DNA-SEQUENCE IN SATELLITE DNA OF CUCUMIS-SATIVUS L (CUCUMBER) [J].
GANAL, M ;
HEMLEBEN, V .
THEORETICAL AND APPLIED GENETICS, 1988, 75 (02) :357-361
[10]   ORGANIZATION AND SEQUENCE-ANALYSIS OF 2 RELATED SATELLITE DNAS IN CUCUMBER (CUCUMIS-SATIVUS L) [J].
GANAL, M ;
RIEDE, I ;
HEMLEBEN, V .
JOURNAL OF MOLECULAR EVOLUTION, 1986, 23 (01) :23-30