Chromosome-level genome assembly of an endangered plant Prunus mongolica using PacBio and Hi-C technologies

被引:4
作者
Zhu, Qiang [1 ,2 ]
Wang, Yali [2 ]
Yao, Ning [1 ]
Ni, Xilu [3 ]
Wang, Cuiping [4 ]
Wang, Meng [1 ]
Zhang, Lei [4 ]
Liang, Wenyu [1 ]
机构
[1] Ningxia Univ, Sch Life Sci, Yinchuan 750021, Peoples R China
[2] Ningxia Forestry Inst, State Key Lab Efficient Prod Forest Resources, Yinchuan 750001, Peoples R China
[3] Ningxia Univ, Sch Ecol & Environm, Yinchuan 750021, Peoples R China
[4] North Minzu Univ, Coll Biol Sci & Engn, Yinchuan 750021, Peoples R China
关键词
Prunus mongolica; endangered plant; chromosome-level genome; genome assembly; DE-NOVO IDENTIFICATION; PHYLOGENETIC ANALYSIS; GENE; PROTEIN; PROGRAM; ALIGNMENT; SEQUENCE; RESPONSES; FAMILIES; ACCURATE;
D O I
10.1093/dnares/dsad012
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Prunus mongolica is an ecologically and economically important xerophytic tree native to Northwest China. Here, we report a high-quality, chromosome-level P. mongolica genome assembly integrating PacBio high-fidelity sequencing and Hi-C technology. The assembled genome was 233.17 Mb in size, with 98.89% assigned to eight pseudochromosomes. The genome had contig and scaffold N50s of 24.33 Mb and 26.54 Mb, respectively, a BUSCO completeness score of 98.76%, and CEGMA indicated that 98.47% of the assembled genome was reliably annotated. The genome contained a total of 88.54 Mb (37.97%) of repetitive sequences and 23,798 protein-coding genes. We found that P. mongolica experienced two whole-genome duplications, with the most recent event occurring similar to 3.57 million years ago. Phylogenetic and chromosome syntenic analyses revealed that P. mongolica was closely related to P. persica and P. dulcis. Furthermore, we identified a number of candidate genes involved in drought tolerance and fatty acid biosynthesis. These candidate genes are likely to prove useful in studies of drought tolerance and fatty acid biosynthesis in P. mongolica, and will provide important genetic resources for molecular breeding and improvement experiments in Prunus species. This high-quality reference genome will also accelerate the study of the adaptation of xerophytic plants to drought.
引用
收藏
页数:13
相关论文
共 86 条
  • [1] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [2] Automated de novo identification of repeat sequence families in sequenced genomes
    Bao, ZR
    Eddy, SR
    [J]. GENOME RESEARCH, 2002, 12 (08) : 1269 - 1276
  • [3] MISA-web: a web server for microsatellite prediction
    Beier, Sebastian
    Thiel, Thomas
    Muench, Thomas
    Scholz, Uwe
    Mascher, Martin
    [J]. BIOINFORMATICS, 2017, 33 (16) : 2583 - 2585
  • [4] Tandem repeats finder: a program to analyze DNA sequences
    Benson, G
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (02) : 573 - 580
  • [5] Global ecosystem thresholds driven by aridity
    Berdugo, Miguel
    Delgado-Baquerizo, Manuel
    Soliveres, Santiago
    Hernandez-Clemente, Rocio
    Zhao, Yanchuang
    Gaitan, Juan J.
    Gross, Nicolas
    Saiz, Hugo
    Maire, Vincent
    Lehman, Anika
    Rillig, Matthias C.
    Sole, Ricard V.
    Maestre, Fernando T.
    [J]. SCIENCE, 2020, 367 (6479) : 787 - +
  • [6] GeneWise and genomewise
    Birney, E
    Clamp, M
    Durbin, R
    [J]. GENOME RESEARCH, 2004, 14 (05) : 988 - 995
  • [7] The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
    Boeckmann, B
    Bairoch, A
    Apweiler, R
    Blatter, MC
    Estreicher, A
    Gasteiger, E
    Martin, MJ
    Michoud, K
    O'Donovan, C
    Phan, I
    Pilbout, S
    Schneider, M
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 365 - 370
  • [8] Trimmomatic: a flexible trimmer for Illumina sequence data
    Bolger, Anthony M.
    Lohse, Marc
    Usadel, Bjoern
    [J]. BIOINFORMATICS, 2014, 30 (15) : 2114 - 2120
  • [9] Fast and sensitive protein alignment using DIAMOND
    Buchfink, Benjamin
    Xie, Chao
    Huson, Daniel H.
    [J]. NATURE METHODS, 2015, 12 (01) : 59 - 60
  • [10] Genome Warehouse: A Public Repository Housing Genome-scale Data
    Chen, Meili
    Ma, Yingke
    Wu, Song
    Zheng, Xinchang
    Kang, Hongen
    Sang, Jian
    Xu, Xingjian
    Hao, Lili
    Li, Zhaohua
    Gong, Zheng
    Xiao, Jingfa
    Zhang, Zhang
    Zhao, Wenming
    Bao, Yiming
    [J]. GENOMICS PROTEOMICS & BIOINFORMATICS, 2021, 19 (04) : 584 - 589