Exploiting single-molecule transcript sequencing for eukaryotic gene prediction

被引:0
作者
André E. Minoche
Juliane C. Dohm
Jessica Schneider
Daniela Holtgräwe
Prisca Viehöver
Magda Montfort
Thomas Rosleff Sörensen
Bernd Weisshaar
Heinz Himmelbauer
机构
[1] Max Planck Institute for Molecular Genetics,Department of Biology/Center for Biotechnology
[2] Centre for Genomic Regulation (CRG),undefined
[3] Universitat Pompeu Fabra (UPF),undefined
[4] University of Natural Resources and Life Sciences (BOKU),undefined
[5] Bielefeld University,undefined
来源
Genome Biology | / 16卷
关键词
Eukaryotic gene prediction; Single-molecule real-time sequencing; mRNA-seq; Caryophyllales; Sugar beet; Spinach; Non-model species; Genome annotation;
D O I
暂无
中图分类号
学科分类号
摘要
We develop a method to predict and validate gene models using PacBio single-molecule, real-time (SMRT) cDNA reads. Ninety-eight percent of full-insert SMRT reads span complete open reading frames. Gene model validation using SMRT reads is developed as automated process. Optimized training and prediction settings and mRNA-seq noise reduction of assisting Illumina reads results in increased gene prediction sensitivity and precision. Additionally, we present an improved gene set for sugar beet (Beta vulgaris) and the first genome-wide gene set for spinach (Spinacia oleracea). The workflow and guidelines are a valuable resource to obtain comprehensive gene sets for newly sequenced genomes of non-model eukaryotes.
引用
收藏
相关论文
共 43 条
  • [31] Insights into microbial communities and metabolic profiles in the traditional production of the two representative Hongqu rice wines fermented with Gutian Qu and Wuyi Qu based on single-molecule real-time sequencing
    Chen, Guimei
    Li, Wenlong
    Yang, Ziyi
    Liang, Zihua
    Chen, Shiyun
    Qiu, Yijian
    Lv, Xucong
    Ai, Lianzhong
    Ni, Li
    [J]. FOOD RESEARCH INTERNATIONAL, 2023, 173
  • [32] Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes
    Corentin Meyer
    Nicolas Scalzitti
    Anne Jeannin-Girardon
    Pierre Collet
    Olivier Poch
    Julie D. Thompson
    [J]. BMC Bioinformatics, 21
  • [33] GeneMark-EP plus : eukaryotic gene prediction with self-training in the space of genes and proteins
    Bruna, Tomas
    Lomsadze, Alexandre
    Borodovsky, Mark
    [J]. NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (02)
  • [34] Exploring the hepatitis C virus genome using single molecule real-time sequencing
    Takeda, Haruhiko
    Yamashita, Taiki
    Ueda, Yoshihide
    Sekine, Akihiro
    [J]. WORLD JOURNAL OF GASTROENTEROLOGY, 2019, 25 (32) : 4661 - 4672
  • [35] Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes
    Meyer, Corentin
    Scalzitti, Nicolas
    Jeannin-Girardon, Anne
    Collet, Pierre
    Poch, Olivier
    Thompson, Julie D.
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [36] Tiling Assembly: a new tool for reference annotation-independent transcript assembly and novel gene identification by RNA-sequencing
    Watanabe, Kenneth A.
    Homayouni, Arielle
    Tufano, Tara
    Lopez, Jennifer
    Ringler, Patricia
    Rushton, Paul
    Shen, Qingxi J.
    [J]. DNA RESEARCH, 2015, 22 (05) : 319 - 329
  • [37] Genome scaffolding and annotation for the pathogen vector Ixodes ricinus by ultra-long single molecule sequencing
    Cramaro, Wibke J.
    Hunewald, Oliver E.
    Bell-Sakyi, Lesley
    Muller, Claude P.
    [J]. PARASITES & VECTORS, 2017, 10
  • [38] Genome scaffolding and annotation for the pathogen vector Ixodes ricinus by ultra-long single molecule sequencing
    Wibke J. Cramaro
    Oliver E. Hunewald
    Lesley Bell-Sakyi
    Claude P. Muller
    [J]. Parasites & Vectors, 10
  • [39] Single molecule real-time sequencing of Xanthomonas oryzae genomes reveals a dynamic structure and complex TAL (transcription activator-like) effector gene relationships
    Booher, Nicholas J.
    Carpenter, Sara C. D.
    Sebra, Robert P.
    Wang, Li
    Salzberg, Steven L.
    Leach, Jan E.
    Bogdanove, Adam J.
    [J]. MICROBIAL GENOMICS, 2015, 1 (04):
  • [40] A highly sensitive and accurate gene expression analysis by sequencing ("bead-seq") for a single cell
    Matsunaga, Hiroko
    Goto, Mari
    Arikawa, Koji
    Shirai, Masataka
    Tsunoda, Hiroyuki
    Huang, Huan
    Kambara, Hideki
    [J]. ANALYTICAL BIOCHEMISTRY, 2015, 471 : 9 - 16