A survey of transcriptome complexity using full-length isoform sequencing in the tea plant Camellia sinensis

被引:2
|
作者
Ma, Dongna [1 ,2 ]
Fang, Jingping [3 ]
Ding, Qiansu [2 ]
Wei, Liufeng [2 ]
Li, Yiying [4 ]
Zhang, Liwen [4 ]
Zhang, Xingtan [1 ]
机构
[1] Chinese Acad Agr Sci, Shenzhen Branch, Guangdong Lab Lingnan Modern Agr, Genome Anal Lab,Minist Agr & Rural Affairs,Agr Ge, Shenzhen, Peoples R China
[2] Xiamen Univ, Key Lab, Minist Educ Coastal & Wetland Ecosyst, Coll Environm & Ecol, Xiamen, Peoples R China
[3] Fujian Normal Univ, Coll Life Sci, Fuzhou, Peoples R China
[4] Fujian Agr & Forestry Univ, Coll Life Sci, Fuzhou, Peoples R China
关键词
Tea plant; PacBio Iso-Seq; LncRNA; Alternative splicing; Alternative polyadenylation; Catechins; LONG NONCODING RNA; DNA METHYLATION; BRASSICA-CAMPESTRIS; MESSENGER-RNA; GENE; POLYADENYLATION; SEQ; EXPRESSION; LANDSCAPE; ALIGNMENT;
D O I
10.1007/s00438-022-01913-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Tea is one of the most popular beverages and its leaves are rich in catechins, contributing to the diverse flavor as well as beneficial for human health. However, the study of the post-transcriptional regulatory mechanism affecting the synthesis of catechins remains insufficient. Here, we sequenced the transcriptome using PacBio sequencing technology and obtained 63,111 full-length high-quality isoforms, including 1302 potential novel genes and 583 highly reliable fusion transcripts. We also identified 1204 lncRNAs with high quality, containing 188 known and 1016 novel lncRNAs. In addition, 311 mis-annotated genes were corrected based on the high-quality Isoseq reads. A large number of alternative splicing (AS) events (3784) and alternative polyadenylation (APA) genes (18,714) were analyzed, accounting for 8.84% and 43.7% of the total annotated genes, respectively. We also found that 2884 genes containing AS and APA features exhibited higher expression levels than other genes. These genes are mainly involved in amino acid biosynthesis, carbon fixation in photosynthetic organisms, phenylalanine, tyrosine, tryptophan biosynthesis, and pyruvate metabolism, suggesting that they play an essential role in the catechins content of tea polyphenols. Our results further improved the level of genome annotation and indicated that post-transcriptional regulation plays a crucial part in synthesizing catechins.
引用
收藏
页码:1243 / 1255
页数:13
相关论文
共 50 条
  • [1] A survey of transcriptome complexity using full-length isoform sequencing in the tea plant Camellia sinensis
    Dongna Ma
    Jingping Fang
    Qiansu Ding
    Liufeng Wei
    Yiying Li
    Liwen Zhang
    Xingtan Zhang
    Molecular Genetics and Genomics, 2022, 297 : 1243 - 1255
  • [2] Full-length isoform concatenation sequencing to resolve cancer transcriptome complexity
    Wijeratne, Saranga
    Gonzalez, Maria E. Hernandez
    Roach, Kelli
    Miller, Katherine E.
    Schieffer, Kathleen M.
    Fitch, James R.
    Leonard, Jeffrey
    White, Peter
    Kelly, Benjamin J.
    Cottrell, Catherine E.
    Mardis, Elaine R.
    Wilson, Richard K.
    Miller, Anthony R.
    BMC GENOMICS, 2024, 25 (01)
  • [3] Cloning and Sequencing of a Full-Length cDNA Encoding the RuBPCase Small Subunit (RbcS) in Tea (Camellia sinensis)
    Ye Ai-hua
    Jiang Chang-jun
    Zhu Lin
    Yu Mei
    Wang Zhao-xia
    Deng Wei-wei
    Wei Chao-lin
    AGRICULTURAL SCIENCES IN CHINA, 2009, 8 (02): : 161 - 166
  • [4] The full-length transcriptome of C. elegans using direct RNA sequencing
    Roach, Nathan P.
    Sadowski, Norah
    Alessi, Amelia F.
    Timp, Winston
    Taylor, James
    Kim, John K.
    GENOME RESEARCH, 2020, 30 (02) : 299 - 312
  • [5] Direct full-length RNA sequencing reveals unexpected transcriptome complexity during Caenorhabditis elegans development
    Li, Runsheng
    Ren, Xiaoliang
    Ding, Qiutao
    Bi, Yu
    Xie, Dongying
    Zhao, Zhongying
    GENOME RESEARCH, 2020, 30 (02) : 287 - 298
  • [6] A global survey of full-length transcriptome of Ginkgo biloba reveals transcript variants involved in flavonoid biosynthesis
    Ye, Jiabao
    Cheng, Shuiyuan
    Zhou, Xian
    Chen, Zexiong
    Kim, Soo Un
    Tan, Junping
    Zheng, Jiarui
    Xu, Feng
    Zhang, Weiwei
    Liao, Yongling
    Zhu, Yongxing
    INDUSTRIAL CROPS AND PRODUCTS, 2019, 139
  • [7] A survey of the full-length transcriptome of Gracilariopsis lemaneiformis using single-molecule long-read sequencing
    Chen, Xiaojiao
    Tang, Yue Yao
    Yin, Haodong
    Sun, Xue
    Zhang, Xiaoqian
    Xu, Nianjun
    BMC PLANT BIOLOGY, 2022, 22 (01):
  • [8] Comprehensive identification of the full-length transcripts and alternative splicing related to the secondary metabolism pathways in the tea plant (Camellia sinensis)
    Qiao, Dahe
    Yang, Chun
    Chen, Juan
    Guo, Yan
    Li, Yan
    Niu, Suzhen
    Cao, Kemei
    Chen, Zhengwu
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [9] Plant ISOform sequencing database (PISO): a comprehensive repertory of full-length transcripts in plants
    Feng, Jia-Wu
    Huang, Shanshan
    Guo, Yi-Xiong
    Liu, Dongxu
    Song, Jia-Ming
    Gao, Junxiang
    Li, Huan
    Chen, Ling-Ling
    PLANT BIOTECHNOLOGY JOURNAL, 2019, 17 (06) : 1001 - 1003
  • [10] Full-length transcriptome sequencing provides insights into flavonoid biosynthesis in Camellia nitidissima Petals
    Liu, Hexia
    Liu, Qin
    Chen, Yuling
    Zhu, Yulin
    Zhou, Xingwen
    Li, Bo
    GENE, 2023, 850