Chromosome genome assembly of the Camphora longepaniculata (Gamble) with PacBio and Hi-C sequencing data

被引:2
作者
Yan, Kuan [1 ,2 ]
Zhu, Hui [3 ]
Cao, Guiling [1 ,2 ]
Meng, Lina [1 ,2 ]
Li, Junqiang [1 ,2 ]
Zhang, Jian [1 ,2 ]
Liu, Sicen [1 ,2 ]
Wang, Yujie [1 ,2 ]
Feng, Ruizhang [1 ,2 ]
Soaud, Salma A. [4 ]
Abd Elhamid, Mohamed A. [4 ]
Heakel, Rania M. Y. [4 ]
Wei, Qin [1 ,2 ]
El-Sappah, Ahmed H. [1 ,2 ,4 ]
Ru, Dafu [3 ]
机构
[1] Yibin Univ, Fac Agr Forestry & Food Engn, Yibin, Peoples R China
[2] Yibin Univ, Sichuan Oil Cinnamon Engn Technol Res Ctr, Yibin, Peoples R China
[3] Lanzhou Univ, Coll Ecol, State Key Lab Herbage Improvement & Grassland Agro, Lanzhou, Peoples R China
[4] Zagazig Univ, Fac Agr, Genet Dept, Zagazig, Egypt
基金
中国国家自然科学基金;
关键词
Camphora longepaniculata; high-throughput sequencing; protein-coding genes; traditional Chinese medicine; terpenoid; TRANSCRIPTOME ANALYSIS; ACYL TRANSFERASE; FAMILY; ANNOTATION; ALIGNMENTS; PROGRAM; FINDER; GENES; TREE; TOOL;
D O I
10.3389/fpls.2024.1372127
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Introduction Camphora longepaniculata, a crucial commercial crop and a fundamental component of traditional Chinese medicine, is renowned for its abundant production of volatile terpenoids. However, the lack of available genomic information has hindered pertinent research efforts in the past. Methods To bridge this gap, the present study aimed to use PacBio HiFi, short-read, and highthroughput chromosome conformation capture sequencing to construct a chromosome-level assembly of the C. longepaniculata genome. Results and discussion With twelve chromosomes accounting for 99.82% (766.69 Mb) of the final genome assembly, which covered 768.10 Mb, it was very complete. Remarkably, the assembly's contig and scaffold N50 values are exceptional as well-41.12 and 63.78 Mb, respectively-highlighting its excellent quality and intact structure. Furthermore, a total of 39,173 protein-coding genes were predicted, with 38,766 (98.96%) of them being functionally annotated. The completeness of the genome was confirmed by the Benchmarking Universal Single-Copy Ortholog evaluation, which revealed 99.01% of highly conserved plant genes. As the first comprehensive assembly of the C. longepaniculata genome, it provides a crucial starting point for deciphering the complex pathways involved in terpenoid production. Furthermore, this excellent genome serves as a vital resource for upcoming research on the breeding and genetics of C. longepaniculata.
引用
收藏
页数:11
相关论文
共 68 条
[1]  
Abeysinghe PD., 2020, Cinnamon: Botany Agronomy Chem. Ind. Appl, P85, DOI [DOI 10.1007/978-3-030-54426-34, DOI 10.1007/978-3-030-54426-3_4, 10.1007/978-3-030-54426-3_4]
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[4]   Repbase Update, a database of repetitive elements in eukaryotic genomes [J].
Bao, Weidong ;
Kojima, Kenji K. ;
Kohany, Oleksiy .
MOBILE DNA, 2015, 6
[5]   Hi-C: A comprehensive technique to capture the conformation of genomes [J].
Belton, Jon-Matthew ;
McCord, Rachel Patton ;
Gibcus, Johan Harmen ;
Naumova, Natalia ;
Zhan, Ye ;
Dekker, Job .
METHODS, 2012, 58 (03) :268-276
[6]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[7]   Investigation of terpene diversification across multiple sequenced plant genomes [J].
Boutanaev, Alexander M. ;
Moses, Tessa ;
Zi, Jiachen ;
Nelson, David R. ;
Mugford, Sam T. ;
Peters, Reuben J. ;
Osbourn, Anne .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (01) :E81-E88
[8]   Stout camphor tree genome fills gaps in understanding of flowering plant genome evolution [J].
Chaw, Shu-Miaw ;
Liu, Yu-Ching ;
Wu, Yu-Wei ;
Wang, Han-Yu ;
Lin, Chan-Yi Ivy ;
Wu, Chung-Shien ;
Ke, Huei-Mien ;
Chang, Lo-Yu ;
Hsu, Chih-Yao ;
Yang, Hui-Ting ;
Sudianto, Edi ;
Hsu, Min-Hung ;
Wu, Kun-Pin ;
Wang, Ling-Ni ;
Leebens-Mack, James H. ;
Tsai, Isheng J. .
NATURE PLANTS, 2019, 5 (01) :63-73
[9]   TBtools-II: A "one for all, all for one"bioinformatics platform for biological big-data mining [J].
Chen, Chengjie ;
Wu, Ya ;
Li, Jiawei ;
Wang, Xiao ;
Zeng, Zaohai ;
Xu, Jing ;
Liu, Yuanlong ;
Feng, Junting ;
Chen, Hao ;
He, Yehua ;
Xia, Rui .
MOLECULAR PLANT, 2023, 16 (11) :1733-1742
[10]   The family of terpene synthases in plants: a mid-size family of genes for specialized metabolism that is highly diversified throughout the kingdom [J].
Chen, Feng ;
Tholl, Dorothea ;
Bohlmann, Joerg ;
Pichersky, Eran .
PLANT JOURNAL, 2011, 66 (01) :212-229