Exploiting single-molecule transcript sequencing for eukaryotic gene prediction

被引:0
作者
André E. Minoche
Juliane C. Dohm
Jessica Schneider
Daniela Holtgräwe
Prisca Viehöver
Magda Montfort
Thomas Rosleff Sörensen
Bernd Weisshaar
Heinz Himmelbauer
机构
[1] Max Planck Institute for Molecular Genetics,Department of Biology/Center for Biotechnology
[2] Centre for Genomic Regulation (CRG),undefined
[3] Universitat Pompeu Fabra (UPF),undefined
[4] University of Natural Resources and Life Sciences (BOKU),undefined
[5] Bielefeld University,undefined
来源
Genome Biology | / 16卷
关键词
Eukaryotic gene prediction; Single-molecule real-time sequencing; mRNA-seq; Caryophyllales; Sugar beet; Spinach; Non-model species; Genome annotation;
D O I
暂无
中图分类号
学科分类号
摘要
We develop a method to predict and validate gene models using PacBio single-molecule, real-time (SMRT) cDNA reads. Ninety-eight percent of full-insert SMRT reads span complete open reading frames. Gene model validation using SMRT reads is developed as automated process. Optimized training and prediction settings and mRNA-seq noise reduction of assisting Illumina reads results in increased gene prediction sensitivity and precision. Additionally, we present an improved gene set for sugar beet (Beta vulgaris) and the first genome-wide gene set for spinach (Spinacia oleracea). The workflow and guidelines are a valuable resource to obtain comprehensive gene sets for newly sequenced genomes of non-model eukaryotes.
引用
收藏
相关论文
共 43 条
[41]   PacBio single molecule real-time sequencing of a full-length transcriptome of the greenfin horse-faced filefish Thamnaconus modestus [J].
Li, Qingfei ;
Wang, Na ;
Sui, Chao ;
Mao, Huadong ;
Zhang, Lu ;
Chen, Jinghua .
FRONTIERS IN MARINE SCIENCE, 2022, 9
[42]   Highly sensitive detection of mutations in CHO cell recombinant DNA using multi-parallel single molecule real-time DNA sequencing [J].
Cartwright, Joseph F. ;
Anderson, Karin ;
Longworth, Joseph ;
Lobb, Philip ;
James, David C. .
BIOTECHNOLOGY AND BIOENGINEERING, 2018, 115 (06) :1485-1498
[43]   The novel HLA-B*44 allele, HLA-B*44:220, identified by Single Molecule Real-Time DNA sequencing in a British Caucasoid male [J].
Hayward, D. R. ;
Bultitude, W. P. ;
Mayor, N. P. ;
Madrigal, J. A. ;
Marsh, S. G. E. .
TISSUE ANTIGENS, 2015, 86 (01) :61-63