Another lesson from unmapped reads: in-depth analysis of RNA-Seq reads from various horse tissues

被引:0
|
作者
Artur Gurgul
Tomasz Szmatoła
Ewa Ocłoń
Igor Jasielczuk
Ewelina Semik-Gurgul
Carrie J. Finno
Jessica L. Petersen
Rebecca Bellone
Erin N. Hales
Tomasz Ząbek
Zbigniew Arent
Małgorzata Kotula-Balak
Monika Bugno-Poniewierska
机构
[1] University of Agriculture in Krakow,Center for Experimental and Innovative Medicine
[2] National Research Institute of Animal Production,Department of Animal Molecular Biology
[3] University of California Davis School of Veterinary Medicine,Department of Population Health and Reproduction
[4] University of Nebraska Lincoln,Department of Animal Science
[5] University of California Davis School of Veterinary Medicine,Veterinary Genetics Laboratory
[6] University of Agriculture in Krakow,University Centre of Veterinary Medicine
[7] University of Agriculture in Kraków,Department of Animal Reproduction, Anatomy and Genomics
来源
Journal of Applied Genetics | 2022年 / 63卷
关键词
Equine; Misassembled; Transcriptome; Genome assembly;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, a vast amount of sequencing data has been generated and large improvements have been made to reference genome sequences. Despite these advances, significant portions of reads still do not map to reference genomes and these reads have been considered as junk or artificial sequences. Recent studies have shown that these reads can be useful, e.g., for refining reference genomes or detecting contaminating microorganisms present in the analyzed biological samples. A special case of this is RNA sequencing (RNA-Seq) reads that come from tissue transcriptomes. Unmapped reads from RNA-Seq have received much less attention than those from whole-genome sequencing. In particular, in the horse, an analysis of unmapped RNA reads has not been performed yet. Thus, in this study, we analyzed the unmapped reads originating from the RNA-Seq performed through the Functional Annotation of Animal Genomes (FAANG) project in the horse, using eight different tissues from two mares. We demonstrated that unmapped reads from RNA-Seq could be easily assembled into transcripts relating to many important genes present in the sequences of other mammals. Large portions of these transcripts did not have coding potential and, thus, can be considered as non-coding RNA. Moreover, reads that were not mapped to the reference genome but aligned to the entries in NCBI database of horse proteins were enriched for biological processes that largely correspond to the functions of organ from which RNA was isolated and thus are presumably true transcripts of genes associated with cell metabolism in those tissues. In addition, a portion of reads aligned to the common pathogenic or neutral microbiota, of which the most common was Brucella spp. These data suggest that unmapped reads can be an important target for in-depth analysis that may substantially enrich results of initial RNA-Seq experiments for various tissues and organs.
引用
收藏
页码:571 / 581
页数:10
相关论文
共 37 条
  • [1] Another lesson from unmapped reads: in-depth analysis of RNA-Seq reads from various horse tissues
    Gurgul, Artur
    Szmatola, Tomasz
    Oclon, Ewa
    Jasielczuk, Igor
    Semik-Gurgul, Ewelina
    Finno, Carrie J.
    Petersen, Jessica L.
    Bellone, Rebecca
    Hales, Erin N.
    Zabek, Tomasz
    Arent, Zbigniew
    Kotula-Balak, Malgorzata
    Bugno-Poniewierska, Monika
    JOURNAL OF APPLIED GENETICS, 2022, 63 (03) : 571 - 581
  • [2] kakapo: easy extraction and annotation of genes from raw RNA-seq reads
    Ramanauskas, Karolis
    Igic, Boris
    PEERJ, 2023, 11
  • [3] Differential Expression Analysis of RNA-seq Reads: Overview, Taxonomy, and Tools
    Chowdhury, Hussain Ahmed
    Bhattacharyya, Dhruba Kumar
    Kalita, Jugal Kumar
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (02) : 566 - 586
  • [4] IsoTree: A New Framework for de novo Transcriptome Assembly from RNA-seq Reads
    Zhao, Jin
    Feng, Haodi
    Zhu, Daming
    Zhang, Chi
    Xu, Ying
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (03) : 938 - 948
  • [5] An optimized protocol for generation and analysis of Ion Proton sequencing reads for RNA-Seq
    Yuan, Yongxian
    Xu, Huaiqian
    Leung, Ross Ka-Kit
    BMC GENOMICS, 2016, 17
  • [6] An optimized protocol for generation and analysis of Ion Proton sequencing reads for RNA-Seq
    Yongxian Yuan
    Huaiqian Xu
    Ross Ka-Kit Leung
    BMC Genomics, 17
  • [7] CLASS2: accurate and efficient splice variant annotation from RNA-seq reads
    Song, Li
    Sabunciyan, Sarven
    Florea, Liliana
    NUCLEIC ACIDS RESEARCH, 2016, 44 (10)
  • [8] Baiting out a full length sequence from unmapped RNA-seq data
    Li, Dongwei
    Huang, Qitong
    Huang, Lei
    Wen, Jikai
    Luo, Jing
    Li, Qing
    Peng, Yanling
    Zhang, Yubo
    BMC GENOMICS, 2021, 22 (01)
  • [9] An analysis of 67 RNA-seq datasets from various tissues at different stages of a model insect, Manduca sexta
    Cao, Xiaolong
    Jiang, Haobo
    BMC GENOMICS, 2017, 18
  • [10] An analysis of 67 RNA-seq datasets from various tissues at different stages of a model insect, Manduca sexta
    Xiaolong Cao
    Haobo Jiang
    BMC Genomics, 18