Full-length transcriptome sequencing reveals extreme incomplete annotation of the goat genome

被引:6
作者
Zhang, Huanhuan [1 ]
Liang, Yilin [1 ]
Chen, Shaomei [2 ]
Xuan, Zeyi [2 ]
Jiang, Yu [1 ]
Li, Ran [1 ,3 ]
Cao, Yanhong [2 ,4 ]
机构
[1] Northwest A&F Univ, Coll Anim Sci & Technol, Key Lab Anim Genet Breeding & Reprod Shaanxi Prov, Yangling, Peoples R China
[2] Guangxi Vocat Univ Agr, Inst Anim Husb, Nanning, Peoples R China
[3] Northwest A&F Univ, Coll Anim Sci & Technol, Key Lab Anim Genet Breeding & Reprod Shaanxi Prov, Yangling 712100, Shaanxi, Peoples R China
[4] Guangxi Vocat Univ Agr, Inst Anim Husb, Nanning 530000, Guangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
full-length transcriptome sequencing; genome annotation; goat; Iso-Seq;
D O I
10.1111/age.13311
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Despite recent advances in generating high-quality reference genome assemblies, the genome sequences for most livestock species, including goats, are still poorly annotated. Single-molecule long-read sequencing has greatly facilitated gene annotation by obtaining full-length transcripts. In this study, we generated full-length transcriptome data for samples from abomasum (n = 2) and testicle (n = 1), using PacBio Iso-Seq technology. We further combined these data with published data from abomasum (5ZY, SRR8618141) to evaluate and improve the gene annotation of the goat genome. We identified 14.5-16.3% of novel genes per sample from the four Iso-Seq datasets. At the transcript level, 40.6% of them were novel, including 29.7% novel transcripts from known genes and 10.9% from novel genes. We further verified the expression of novel genes in four additional RNA-seq data and found that the expression level of novel genes was significantly lower than that of known genes, indicating that the lowly expressed genes tend to be missed in the current genome annotation. This study shows the superiority of full-length transcriptome data in gene annotation, and more such data are required to improve the gene annotation for goat genome and other species.
引用
收藏
页码:421 / 424
页数:4
相关论文
共 19 条
[1]   PacBio Iso-Seq Improves the Rainbow Trout Genome Annotation and Identifies Alternative Splicing Associated With Economically Important Phenotypes [J].
Ali, Ali ;
Thorgaard, Gary H. H. ;
Salem, Mohamed .
FRONTIERS IN GENETICS, 2021, 12
[2]   Improved annotation of the domestic pig genome through integration of Iso-Seq and RNA-seq data [J].
Beiki, H. ;
Liu, H. ;
Huang, J. ;
Manchanda, N. ;
Nonneman, D. ;
Smith, T. P. L. ;
Reecy, J. M. ;
Tuggle, C. K. .
BMC GENOMICS, 2019, 20 (1)
[3]   Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome [J].
Bickhart, Derek M. ;
Rosen, Benjamin D. ;
Koren, Sergey ;
Sayre, Brian L. ;
Hastie, Alex R. ;
Chan, Saki ;
Lee, Joyce ;
Lam, Ernest T. ;
Liachko, Ivan ;
Sullivan, Shawn T. ;
Burton, Joshua N. ;
Huson, Heather J. ;
Nystrom, John C. ;
Kelley, Christy M. ;
Hutchison, Jana L. ;
Zhou, Yang ;
Sun, Jiajie ;
Crisa, Alessandra ;
de Leon, F. Abel Ponce ;
Schwartz, John C. ;
Hammond, John A. ;
Waldbieser, Geoffrey C. ;
Schroeder, Steven G. ;
Liu, George E. ;
Dunham, Maitreya J. ;
Shendure, Jay ;
Sonstegard, Tad S. ;
Phillippy, Adam M. ;
Van Tassell, Curtis P. ;
Smith, Timothy P. L. .
NATURE GENETICS, 2017, 49 (04) :643-+
[4]   The role of the goat in society: Past, present and perspectives for the future [J].
Boyazoglu, J ;
Hatziminologlou, I ;
Morand-Fehr, P .
SMALL RUMINANT RESEARCH, 2005, 60 (1-2) :13-23
[5]   fastp: an ultra-fast all-in-one FASTQ preprocessor [J].
Chen, Shifu ;
Zhou, Yanqing ;
Chen, Yaru ;
Gu, Jia .
BIOINFORMATICS, 2018, 34 (17) :884-890
[6]   STAR: ultrafast universal RNA-seq aligner [J].
Dobin, Alexander ;
Davis, Carrie A. ;
Schlesinger, Felix ;
Drenkow, Jorg ;
Zaleski, Chris ;
Jha, Sonali ;
Batut, Philippe ;
Chaisson, Mark ;
Gingeras, Thomas R. .
BIOINFORMATICS, 2013, 29 (01) :15-21
[7]   Reconstruction of the full-length transcriptome atlas using PacBio Iso-Seq provides insight into the alternative splicing in Gossypium australe [J].
Feng, Shouli ;
Xu, Min ;
Liu, Fujie ;
Cui, Changjiang ;
Zhou, Baoliang .
BMC PLANT BIOLOGY, 2019, 19 (01)
[8]   Full-length transcriptome assembly from RNA-Seq data without a reference genome [J].
Grabherr, Manfred G. ;
Haas, Brian J. ;
Yassour, Moran ;
Levin, Joshua Z. ;
Thompson, Dawn A. ;
Amit, Ido ;
Adiconis, Xian ;
Fan, Lin ;
Raychowdhury, Raktima ;
Zeng, Qiandong ;
Chen, Zehua ;
Mauceli, Evan ;
Hacohen, Nir ;
Gnirke, Andreas ;
Rhind, Nicholas ;
di Palma, Federica ;
Birren, Bruce W. ;
Nusbaum, Chad ;
Lindblad-Toh, Kerstin ;
Friedman, Nir ;
Regev, Aviv .
NATURE BIOTECHNOLOGY, 2011, 29 (07) :644-U130
[9]   Full-length transcriptome reconstruction reveals genetic differences in hybrids of Oryza sativa and Oryza punctata with different ploidy and genome compositions [J].
He, Wenting ;
Zhang, Xianhua ;
Lv, Pincang ;
Wang, Wei ;
Wang, Jie ;
He, Yuchi ;
Song, Zhaojian ;
Cai, Detian .
BMC PLANT BIOLOGY, 2022, 22 (01)
[10]   Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences [J].
Li, Heng .
BIOINFORMATICS, 2016, 32 (14) :2103-2110