De novo assembly of a Chinese soybean genome

被引:125
作者
Shen, Yanting [1 ,5 ]
Liu, Jing [2 ]
Geng, Haiying [3 ]
Zhang, Jixiang [1 ,5 ]
Liu, Yucheng [1 ,5 ]
Zhang, Haikuan [4 ]
Xing, Shilai [4 ]
Du, Jianchang [2 ]
Ma, Shisong [3 ]
Tian, Zhixi [1 ,5 ]
机构
[1] Chinese Acad Sci, Inst Genet & Dev Biol, State Key Lab Plant Cell & Chromosome Engn, Beijing 100101, Peoples R China
[2] Jiangsu Acad Agr Sci, Inst Crop Germplasm & Biotechnol, Prov Key Lab Agrobiol, Nanjing 210014, Jiangsu, Peoples R China
[3] Univ Sci & Technol China, Sch Life Sci, Hefei 230027, Anhui, Peoples R China
[4] Berry Genom Corp, Beijing 100015, Peoples R China
[5] Univ Chinese Acad Sci, Beijing 100039, Peoples R China
基金
中国国家自然科学基金;
关键词
de novo soybean genome; Zhonghuang; 13; Gmax_ZH13; structure variation; gene co-expression; QUANTITATIVE TRAIT LOCI; MAX L.-MERR; RNA-SEQ DATA; GLYCINE-MAX; FLOWERING TIME; AGRONOMIC TRAITS; TRANSPOSABLE ELEMENTS; GENETIC DIVERSITY; MATURITY LOCUS; QTL ANALYSIS;
D O I
10.1007/s11427-018-9360-0
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Soybean was domesticated in China and has become one of the most important oilseed crops. Due to bottlenecks in their introduction and dissemination, soybeans from different geographic areas exhibit extensive genetic diversity. Asia is the largest soybean market; therefore, a high-quality soybean reference genome from this area is critical for soybean research and breeding. Here, we report the de novo assembly and sequence analysis of a Chinese soybean genome for "Zhonghuang 13" by a combination of SMRT, Hi-C and optical mapping data. The assembled genome size is 1.025 Gb with a contig N50 of 3.46 Mb and a scaffold N50 of 51.87 Mb. Comparisons between this genome and the previously reported reference genome (cv. Williams 82) uncovered more than 250,000 structure variations. A total of 52,051 protein coding genes and 36,429 transposable elements were annotated for this genome, and a gene co-expression network including 39,967 genes was also established. This high quality Chinese soybean genome and its sequence analysis will provide valuable information for soybean improvement in the future.
引用
收藏
页码:871 / 884
页数:14
相关论文
共 111 条
[1]   HiCPlotter integrates genomic data with interaction matrices [J].
Akdemir, Kadir Caner ;
Chin, Lynda .
GENOME BIOLOGY, 2015, 16
[2]  
[Anonymous], 2004, SOYBEANS IMPROVEMENT
[3]   The sunflower genome provides insights into oil metabolism, flowering and Asterid evolution [J].
Badouin, Helene ;
Gouzy, Jerome ;
Grassa, Christopher J. ;
Murat, Florent ;
Staton, S. Evan ;
Cottret, Ludovic ;
Lelandais-Briere, Christine ;
Owens, Gregory L. ;
Carrere, Sebastien ;
Mayjonade, Baptiste ;
Legrand, Ludovic ;
Gill, Navdeep ;
Kane, Nolan C. ;
Bowers, John E. ;
Hubner, Sariel ;
Bellec, Arnaud ;
Berard, Aurelie ;
Berges, Helene ;
Blanchet, Nicolas ;
Boniface, Marie-Claude ;
Brunel, Dominique ;
Catrice, Olivier ;
Chaidir, Nadia ;
Claudel, Clotilde ;
Donnadieu, Cecile ;
Faraut, Thomas ;
Fievet, Ghislain ;
Helmstetter, Nicolas ;
King, Matthew ;
Knapp, Steven J. ;
Lai, Zhao ;
Le Paslier, Marie-Christine ;
Lippi, Yannick ;
Lorenzon, Lolita ;
Mandel, Jennifer R. ;
Marage, Gwenola ;
Marchand, Gwenaelle ;
Marquand, Elodie ;
Bret-Mestries, Emmanuelle ;
Morien, Evan ;
Nambeesan, Savithri ;
Thuy Nguyen ;
Pegot-Espagnet, Prune ;
Pouilly, Nicolas ;
Raftis, Frances ;
Sallet, Erika ;
Schiex, Thomas ;
Thomas, Justine ;
Vandecasteele, Celine ;
Vares, Didier .
NATURE, 2017, 546 (7656) :148-+
[4]   GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses [J].
Besemer, J ;
Borodovsky, M .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W451-W454
[5]   Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome [J].
Bickhart, Derek M. ;
Rosen, Benjamin D. ;
Koren, Sergey ;
Sayre, Brian L. ;
Hastie, Alex R. ;
Chan, Saki ;
Lee, Joyce ;
Lam, Ernest T. ;
Liachko, Ivan ;
Sullivan, Shawn T. ;
Burton, Joshua N. ;
Huson, Heather J. ;
Nystrom, John C. ;
Kelley, Christy M. ;
Hutchison, Jana L. ;
Zhou, Yang ;
Sun, Jiajie ;
Crisa, Alessandra ;
de Leon, F. Abel Ponce ;
Schwartz, John C. ;
Hammond, John A. ;
Waldbieser, Geoffrey C. ;
Schroeder, Steven G. ;
Liu, George E. ;
Dunham, Maitreya J. ;
Shendure, Jay ;
Sonstegard, Tad S. ;
Phillippy, Adam M. ;
Van Tassell, Curtis P. ;
Smith, Timothy P. L. .
NATURE GENETICS, 2017, 49 (04) :643-+
[6]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[7]   Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions [J].
Burton, Joshua N. ;
Adey, Andrew ;
Patwardhan, Rupali P. ;
Qiu, Ruolan ;
Kitzman, Jacob O. ;
Shendure, Jay .
NATURE BIOTECHNOLOGY, 2013, 31 (12) :1119-+
[8]  
Byrum J. R., 1995, Soybean Genetics Newsletter, V22, P181
[9]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10
[10]   Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory [J].
Chaisson, Mark J. ;
Tesler, Glenn .
BMC BIOINFORMATICS, 2012, 13