Chromosome-level genome assembly and characterization of the Calophaca sinica genome

被引:0
作者
Cao, Jianting [2 ]
Zhu, Hui [1 ]
Gao, Yingqi [3 ]
Hu, Yue [3 ]
Li, Xuejiao [3 ]
Shi, Jianwei [3 ]
Chen, Luqin [2 ]
Kang, Hao [2 ]
Ru, Dafu [1 ]
Ren, Baoqing [2 ]
Liu, Bingbing [3 ]
机构
[1] Taiyuan Bot Garden, Taiyuan, Peoples R China
[2] Lanzhou Univ, Coll Ecol, State Key Lab Grassland Agroecosyst, Lanzhou, Peoples R China
[3] Shanxi Univ, Inst Loess Plateau, Taiyuan, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Calophaca sinica; Fabaceae; gene duplication; WGD; TRD; READ ALIGNMENT; GENE FAMILY; SEQUENCE; TOOL; EVOLUTION; ANNOTATION; PROVIDES; PROGRAM; CONSEQUENCES; DUPLICATION;
D O I
10.1093/dnares/dsae011
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Calophaca sinica is a rare plant endemic to northern China which belongs to the Fabaceae family and possesses rich nutritional value. To support the preservation of the genetic resources of this plant, we have successfully generated a high-quality genome of C. sinica (1.06 Gb). Notably, transposable elements (TEs) constituted similar to 73% of the genome, with long terminal repeat retrotransposons (LTR-RTs) dominating this group of elements (similar to 54% of the genome). The average intron length of the C. sinica genome was noticeably longer than what has been observed for closely related species. The expansion of LTR-RTs and elongated introns emerged had the largest influence on the enlarged genome size of C. sinica in comparison to other Fabaceae species. The proliferation of TEs could be explained by certain modes of gene duplication, namely, whole genome duplication (WGD) and dispersed duplication (DSD). Gene family expansion, which was found to enhance genes associated with metabolism, genetic maintenance, and environmental stress resistance, was a result of transposed duplicated genes (TRD) and WGD. The presented genomic analysis sheds light on the genetic architecture of C. sinica, as well as provides a starting point for future evolutionary biology, ecology, and functional genomics studies centred around C. sinica and closely related species.
引用
收藏
页数:12
相关论文
共 87 条
[1]   TEclass-a tool for automated classification of unknown eukaryotic transposable elements [J].
Abrusan, Gyorgy ;
Grundmann, Norbert ;
DeMester, Luc ;
Makalowski, Wojciech .
BIOINFORMATICS, 2009, 25 (10) :1329-1330
[2]   A modified protocol for rapid DNA isolation from plant tissues using cetyltrimethylammonium bromide [J].
Allen, G. C. ;
Flores-Vergara, M. A. ;
Krasnyanski, S. ;
Kumar, S. ;
Thompson, W. F. .
NATURE PROTOCOLS, 2006, 1 (05) :2320-2325
[3]   The universal protein resource (UniProt) [J].
Bairoch, A ;
Apweiler, R ;
Wu, CH ;
Barker, WC ;
Boeckmann, B ;
Ferro, S ;
Gasteiger, E ;
Huang, HZ ;
Lopez, R ;
Magrane, M ;
Martin, MJ ;
Natale, DA ;
O'Donovan, C ;
Redaschi, N ;
Yeh, LSL .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D154-D159
[4]   Patterns in grass genome evolution [J].
Bennetzen, Jeffrey L. .
CURRENT OPINION IN PLANT BIOLOGY, 2007, 10 (02) :176-181
[5]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[6]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10
[7]   trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses [J].
Capella-Gutierrez, Salvador ;
Silla-Martinez, Jose M. ;
Gabaldon, Toni .
BIOINFORMATICS, 2009, 25 (15) :1972-1973
[8]   Review: Mitogen-Activated Protein Kinases in nutritional signaling in Arabidopsis [J].
Chardin, Camille ;
Schenk, Sebastian T. ;
Hirt, Heribert ;
Colcombet, Jean ;
Krapp, Anne .
PLANT SCIENCE, 2017, 260 :101-108
[9]   fastp: an ultra-fast all-in-one FASTQ preprocessor [J].
Chen, Shifu ;
Zhou, Yanqing ;
Chen, Yaru ;
Gu, Jia .
BIOINFORMATICS, 2018, 34 (17) :884-890
[10]  
Chinese Botanical Committee of the Chinese Academy of Sciencces, 1993, Flora Reipublicae Popularis Sinicae: Calophaca Fisch, V42, P67