Comprehensive annotation of the Chinese tree shrew genome by large-scale RNA sequencing and long-read isoform sequencing

被引:26
|
作者
Ye, Mao-Sen [1 ,2 ]
Zhang, Jin-Yan [1 ,2 ]
Yu, Dan-Dan [1 ,3 ]
Xu, Min [1 ,3 ]
Xu, Ling [1 ,3 ]
Lv, Long-Bao [3 ]
Zhu, Qi-Yun [4 ]
Fan, Yu [1 ,3 ]
Yao, Yong-Gang [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Key Lab Anim Models & Human Dis Mech, Kunming Inst Zool, KIZ CUHK Joint Lab Bioresources & Mol Res Common, Kunming 650204, Yunnan, Peoples R China
[2] Univ Chinese Acad Sci, Kunming Coll Life Sci, Kunming 650204, Yunnan, Peoples R China
[3] Chinese Acad Sci, Natl Resource Ctr Nonhuman Primates, Kunming Inst Zool, Natl Res Facil Phenotyp & Genet Anal Model Anim P, Kunming 650107, Yunnan, Peoples R China
[4] Chinese Acad Agr Sci, Lanzhou Vet Res Inst, State Key Lab Vet Etiol Biol, Lanzhou 730046, Gansu, Peoples R China
基金
中国国家自然科学基金;
关键词
Tree shrew; Genome annotation; Transcriptome; Gene family; Virus infection; TUPAIA-BELANGERI; ANIMAL-MODELS; INDUCED MYOPIA; GENE; PROTEIN; FAMILY; TRANSCRIPTOME; BIOGENESIS; GENERATION; PRIMATES;
D O I
10.24272/j.issn.2095-8137.2021.272
中图分类号
Q95 [动物学];
学科分类号
071002 ;
摘要
The Chinese tree shrew (Tupaia belangeri chinensis) is emerging as an important experimental animal in multiple fields of biomedical research. Comprehensive reference genome annotation for both mRNA and long non-coding RNA (lncRNA) is crucial for developing animal models using this species. In the current study, we collected a total of 234 high-quality RNA sequencing (RNA-seq) datasets and two long-read isoform sequencing (ISO-seq) datasets and improved the annotation of our previously assembled high-quality chromosome-level tree shrew genome. We obtained a total of 3 514 newly annotated coding genes and 50 576 lncRNA genes. We also characterized the tissue-specific expression patterns and alternative splicing patterns of mRNAs and lncRNAs and mapped the orthologous relationships among 11 mammalian species using the current annotated genome. We identified 144 tree shrew-specific gene families, including interleukin 6 (IL6) and STT3 oligosaccharyltransferase complex catalytic subunit B (STT3B), which underwent significant changes in size. Comparison of the overall expression patterns in tissues and pathways across four species (human, rhesus monkey, tree shrew, and mouse) indicated that tree shrews are more similar to primates than to mice at the tissue-transcriptome level. Notably, the newly annotated purine rich element binding protein A (PURA) gene and the STT3B gene family showed dysregulation upon viral infection. The updated version of the tree shrew genome annotation (KIZ version 3: TS_3.0) is available at http://www. treeshrewdb.org and provides an essential reference for basic and biomedical studies using tree shrew animal models.
引用
收藏
页码:692 / 709
页数:18
相关论文
共 50 条
  • [1] Comprehensive annotation of the Chinese tree shrew genome by large-scale RNA sequencing and long-read isoform sequencing
    Mao-Sen Ye
    Jin-Yan Zhang
    Dan-Dan Yu
    Min Xu
    Ling Xu
    Long-Bao Lv
    Qi-Yun Zhu
    Yu Fan
    Yong-Gang Yao
    Zoological Research, 2021, 42 (06) : 692 - 709
  • [2] Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing
    Cook, David E.
    Valle-Inclan, Jose Espejo
    Pajoro, Alice
    Rovenich, Hanna
    Thomma, Bart P. H. J.
    Faino, Luigi
    PLANT PHYSIOLOGY, 2019, 179 (01) : 38 - 54
  • [3] Genome sequencing using long-read sequencing
    McEwen, Juan Guillermo
    Gomez, Oscar Mauricio
    REVISTA DE LA ACADEMIA COLOMBIANA DE CIENCIAS EXACTAS FISICAS Y NATURALES, 2023, 47 (183): : 439 - 444
  • [4] Comprehensive characterization of single-cell isoform in mouse retina with long-read RNA sequencing
    Wang, Meng
    Oh, Soo
    Li, Yumei
    Cheng, Xuesen
    Wang, Jun
    Chen, Rui
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2023, 64 (08)
  • [5] Long-read sequencing and de novo assembly of a Chinese genome
    Shi, Lingling
    Guo, Yunfei
    Dong, Chengliang
    Huddleston, John
    Yang, Hui
    Han, Xiaolu
    Fu, Aisi
    Li, Quan
    Li, Na
    Gong, Siyi
    Lintner, Katherine E.
    Ding, Qiong
    Wang, Zou
    Hu, Jiang
    Wang, Depeng
    Wang, Feng
    Wang, Lin
    Lyon, Gholson J.
    Guan, Yongtao
    Shen, Yufeng
    Evgrafov, Oleg V.
    Knowles, James A.
    Thibaud-Nissen, Francoise
    Schneider, Valerie
    Yu, Chack-Yung
    Zhou, Libing
    Eichler, Evan E.
    So, Kwok-Fai
    Wang, Kai
    NATURE COMMUNICATIONS, 2016, 7
  • [6] Long-read sequencing and de novo assembly of a Chinese genome
    Lingling Shi
    Yunfei Guo
    Chengliang Dong
    John Huddleston
    Hui Yang
    Xiaolu Han
    Aisi Fu
    Quan Li
    Na Li
    Siyi Gong
    Katherine E. Lintner
    Qiong Ding
    Zou Wang
    Jiang Hu
    Depeng Wang
    Feng Wang
    Lin Wang
    Gholson J. Lyon
    Yongtao Guan
    Yufeng Shen
    Oleg V. Evgrafov
    James A. Knowles
    Francoise Thibaud-Nissen
    Valerie Schneider
    Chack-Yung Yu
    Libing Zhou
    Evan E. Eichler
    Kwok-Fai So
    Kai Wang
    Nature Communications, 7
  • [7] A comprehensive long-read isoform analysis platform and sequencing resource for breast cancer
    Veiga, Diogo F. T.
    Nesta, Alex
    Zhao, Yuqi
    Mays, Anne Deslattes
    Huynh, Richie
    Rossi, Robert
    Wu, Te-Chia
    Palucka, Karolina
    Anczukow, Olga
    Beck, Christine R.
    Banchereau, Jacques
    SCIENCE ADVANCES, 2022, 8 (03)
  • [8] Comprehensive assessment of mRNA isoform detection methods for long-read sequencing data
    Su, Yaqi
    Yu, Zhejian
    Jin, Siqian
    Ai, Zhipeng
    Yuan, Ruihong
    Chen, Xinyi
    Xue, Ziwei
    Guo, Yixin
    Chen, Di
    Liang, Hongqing
    Liu, Zuozhu
    Liu, Wanlu
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [9] Comprehensive Characterization of AAV Vectors' Genome Integrity Through Long-Read Sequencing
    Chen, Ting
    Elliott, Kirk
    Zhang, Amanda
    Mayer, Ayda
    Jin, Lan
    Shi, Mi
    Danos, Olivier
    Liu, Ye
    MOLECULAR THERAPY, 2024, 32 (04) : 248 - 248
  • [10] Long-read RNA sequencing can probe organelle genome pervasive transcription
    Lima, Matheus Sanita
    Silva Domingues, Douglas
    Rossi Paschoal, Alexandre
    Smith, David Roy
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2024, 23 (06) : 695 - 701