Haplotype-resolved chromosomal-level genome assembly of Buzhaye (Microcos paniculata)

被引:1
作者
Liu, Detuan [1 ,2 ,3 ]
Tian, Xiaoling [4 ]
Shao, Shicheng [5 ]
Ma, Yongpeng [1 ,2 ]
Zhang, Rengang [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Kunming Inst Bot, Yunnan Key Lab Integrat Conservat Plant Species Ex, Kunming 650201, Peoples R China
[2] Chinese Acad Sci, Kunming Inst Bot, CAS Key Lab Plant Divers & Biogeog East Asia, Kunming 650201, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
[4] Yunnan Univ, Inst Int Rivers & Ecosecur, Kunming 650500, Peoples R China
[5] Chinese Acad Sci, CAS Key Lab Trop Forest Ecol, Xishuangbanna Trop Bot Garden, Mengla 666303, Peoples R China
关键词
ANNOTATION; PROVIDES; SYSTEM;
D O I
10.1038/s41597-023-02821-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Microcos paniculata is a shrub used traditionally as folk medicine and to make herbal teas. Previous research into this species has mainly focused on its chemical composition and medicinal value. However, the lack of a reference genome limits the study of the molecular mechanisms of active compounds in this species. Here, we assembled a haplotype-resolved chromosome-level genome of M. paniculata based on PacBio HiFi and Hi-C data. The assembly contains two haploid genomes with sizes 399.43 Mb and 393.10 Mb, with contig N50 lengths of 43.44 Mb and 30.17 Mb, respectively. About 99.93% of the assembled sequences could be anchored to 18 pseudo-chromosomes. Additionally, a total of 482 Mb repeat sequences were identified, accounting for 60.76% of the genome. A total of 49,439 protein-coding genes were identified, of which 48,979 (99%) were functionally annotated. This haplotype-resolved chromosome-level assembly and annotation of M. paniculata will serve as a valuable resource for investigating the biosynthesis and genetic basis of active compounds in this species, as well as advancing evolutionary phylogenomic studies in Malvales.
引用
收藏
页数:10
相关论文
共 54 条
[1]  
[Anonymous], 2023, NCBI Sequence Read Archive
[2]  
[Anonymous], 2023, NCBI Assembly
[3]   Fast and sensitive protein alignment using DIAMOND [J].
Buchfink, Benjamin ;
Xie, Chao ;
Huson, Daniel H. .
NATURE METHODS, 2015, 12 (01) :59-60
[4]   TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data [J].
Chen, Chengjie ;
Chen, Hao ;
Zhang, Yi ;
Thomas, Hannah R. ;
Frank, Margaret H. ;
He, Yehua ;
Xia, Rui .
MOLECULAR PLANT, 2020, 13 (08) :1194-1202
[5]   Araport11: a complete reannotation of the Arabidopsis thaliana reference genome [J].
Cheng, Chia-Yi ;
Krishnakumar, Vivek ;
Chan, Agnes P. ;
Thibaud-Nissen, Francoise ;
Schobel, Seth ;
Town, Christopher D. .
PLANT JOURNAL, 2017, 89 (04) :789-804
[6]   Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm [J].
Cheng, Haoyu ;
Concepcion, Gregory T. ;
Feng, Xiaowen ;
Zhang, Haowen ;
Li, Heng .
NATURE METHODS, 2021, 18 (02) :170-+
[7]   Genome sequence of the agarwood tree Aquilaria sinensis (Lour.) Spreng: the first chromosome-level draft genome in the Thymelaeceae family [J].
Ding, Xupo ;
Mei, Wenli ;
Lin, Qiang ;
Wang, Hao ;
Wang, Jun ;
Peng, Shiqing ;
Li, Huiliang ;
Zhu, Jiahong ;
Li, Wei ;
Wang, Pei ;
Chen, Huiqin ;
Dong, Wenhua ;
Guo, Dong ;
Cai, Caihong ;
Huang, Shengzhuo ;
Cui, Peng ;
Dai, Haofu .
GIGASCIENCE, 2020, 9 (03)
[8]  
Doyle J.J., 1987, BULLETIN, V19, P11
[9]   De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds [J].
Dudchenko, Olga ;
Batra, Sanjit S. ;
Omer, Arina D. ;
Nyquist, Sarah K. ;
Hoeger, Marie ;
Durand, Neva C. ;
Shamim, Muhammad S. ;
Machol, Ido ;
Lander, Eric S. ;
Aiden, Aviva Presser ;
Aiden, Erez Lieberman .
SCIENCE, 2017, 356 (6333) :92-95
[10]   Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom [J].
Durand, Neva C. ;
Robinson, James T. ;
Shamim, Muhammad S. ;
Machol, Ido ;
Mesirov, Jill P. ;
Lander, Eric S. ;
Aiden, Erez Lieberman .
CELL SYSTEMS, 2016, 3 (01) :99-101