De novo genome assembly of the potent medicinal plant Rehmannia glutinosa using nanopore technology

被引:24
作者
Ma, Ligang [1 ,2 ]
Dong, Chengming [1 ,2 ]
Song, Chi [3 ]
Wang, Xiaolan [1 ,2 ]
Zheng, Xiaoke [1 ,2 ]
Niu, Yan [4 ]
Chen, Shilin [3 ]
Feng, Weisheng [1 ,2 ]
机构
[1] Henan Univ Chinese Med, Coll Pharm, Zhengzhou 450046, Henan, Peoples R China
[2] Coconstruct Collaborat Innovat Ctr Chinese Med &, Henan & Educ Minist PR China, Zhengzhou 450046, Henan, Peoples R China
[3] Inst Chinese Materia Med, China Acad Chinese Med Sci, Key Lab Beijing Identificat & Safety Evaluat Chin, Beijing 100700, Peoples R China
[4] Wuhan Benagen Tech Solut Co Ltd, Wuhan 430070, Hubei, Peoples R China
关键词
Rehmannia glutinosa; Genome sequence; Oxford Nanopore Technology; Medicinal plant; Genome evolution; PHYLOGENETIC ANALYSIS; ANNOTATION; BIOSYNTHESIS; EVOLUTION; FAMILY; GENES; TOOL; GLYCOSIDE; IRIDOIDS; CATALPOL;
D O I
10.1016/j.csbj.2021.07.006
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Rehmannia glutinosa is a potent medicinal plant with a significant importance in traditional Chinese medicine. Its root is enriched with various bioactive molecules mainly iridoids, possessing important pharmaceutical properties. However, the molecular biology and evolution of R. glutinosa have been largely unexplored. Here, we report a reference genome of R. glutinosa using Nanopore technology, Illumina and Hi-C sequencing. The assembly genome is 2.49 Gb long with a scaffold N50 length of 70 Mb and high heterozygosity (2%). Since R. glutinosa is an autotetraploid (4n = 56), the difference between each set of chromosomes is very small, and it is difficult to distinguish the two sets of chromosomes using Hi-C. Hence, only one set of the genome size was mounted to the chromosome level. Scaffolds covering 52.61% of the assembled genome were anchored on 14 pseudochromosomes. Over 67% of the genome consists of repetitive sequences dominated by Copia long terminal repeats and 48,475 protein-coding genes were predicted. Phylogenetic analysis corroborates the placement of R. glutinosa in the Orobanchaceae family. Our results indicated an independent and very recent whole genome duplication event that occurred 3.64 million year ago in the R. glutinosa lineage. Comparative genomics analysis demonstrated expansion of the UDP-dependent glycosyltransferases and terpene synthase gene families, known to be involved in terpenoid biosynthesis and diversification. Furthermore, the molecular biosyn-thetic pathway of iridoids has been clarified in this work. Collectively, the generated reference genome of R. glutinosa will facilitate discovery and development of important pharmacological compounds. (C) 2021 Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Bio-technology.
引用
收藏
页码:3954 / 3963
页数:10
相关论文
共 104 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] Opportunities and challenges in long-read sequencing data analysis
    Amarasinghe, Shanika L.
    Su, Shian
    Dong, Xueyi
    Zappia, Luke
    Ritchie, Matthew E.
    Gouil, Quentin
    [J]. GENOME BIOLOGY, 2020, 21 (01)
  • [3] Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
  • [4] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [5] Catalpol protects mesencephalic neurons against MPTP induced neurotoxicity via attenuation of mitochondrial dysfunction and MAO-B activity
    Bi, Jing
    Wang, Xiao-bo
    Chen, Lei
    Hao, Shuang
    An, Li-jia
    Jiang, Bo
    Guo, Lei
    [J]. TOXICOLOGY IN VITRO, 2008, 22 (08) : 1883 - 1889
  • [6] Byng JW, 2016, BOT J LINN SOC, V181, P1, DOI [10.1111/boj.12385, 10.1111/j.1095-8339.2009.00996.x]
  • [7] MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes
    Cantarel, Brandi L.
    Korf, Ian
    Robb, Sofia M. C.
    Parra, Genis
    Ross, Eric
    Moore, Barry
    Holt, Carson
    Alvarado, Alejandro Sanchez
    Yandell, Mark
    [J]. GENOME RESEARCH, 2008, 18 (01) : 188 - 196
  • [8] Chakraborty Prasanta, 2018, Biochim Open, V6, P9, DOI 10.1016/j.biopen.2017.12.003
  • [9] The family of terpene synthases in plants: a mid-size family of genes for specialized metabolism that is highly diversified throughout the kingdom
    Chen, Feng
    Tholl, Dorothea
    Bohlmann, Joerg
    Pichersky, Eran
    [J]. PLANT JOURNAL, 2011, 66 (01) : 212 - 229
  • [10] Genome sequence of the olive tree, Olea europaea
    Cruz, Fernando
    Julca, Irene
    Gomez-Garrido, Jessica
    Loska, Damian
    Marcet-Houben, Marina
    Cano, Emilio
    Galan, Beatriz
    Frias, Leonor
    Ribeca, Paolo
    Derdak, Sophia
    Gut, Marta
    Sanchez-Fernandez, Manuel
    Luis Garcia, Jose
    Gut, Ivo G.
    Vargas, Pablo
    Alioto, Tyler S.
    Gabaldon, Toni
    [J]. GIGASCIENCE, 2016, 5