Exploiting single-molecule transcript sequencing for eukaryotic gene prediction

被引:0
作者
André E. Minoche
Juliane C. Dohm
Jessica Schneider
Daniela Holtgräwe
Prisca Viehöver
Magda Montfort
Thomas Rosleff Sörensen
Bernd Weisshaar
Heinz Himmelbauer
机构
[1] Max Planck Institute for Molecular Genetics,Department of Biology/Center for Biotechnology
[2] Centre for Genomic Regulation (CRG),undefined
[3] Universitat Pompeu Fabra (UPF),undefined
[4] University of Natural Resources and Life Sciences (BOKU),undefined
[5] Bielefeld University,undefined
来源
Genome Biology | / 16卷
关键词
Eukaryotic gene prediction; Single-molecule real-time sequencing; mRNA-seq; Caryophyllales; Sugar beet; Spinach; Non-model species; Genome annotation;
D O I
暂无
中图分类号
学科分类号
摘要
We develop a method to predict and validate gene models using PacBio single-molecule, real-time (SMRT) cDNA reads. Ninety-eight percent of full-insert SMRT reads span complete open reading frames. Gene model validation using SMRT reads is developed as automated process. Optimized training and prediction settings and mRNA-seq noise reduction of assisting Illumina reads results in increased gene prediction sensitivity and precision. Additionally, we present an improved gene set for sugar beet (Beta vulgaris) and the first genome-wide gene set for spinach (Spinacia oleracea). The workflow and guidelines are a valuable resource to obtain comprehensive gene sets for newly sequenced genomes of non-model eukaryotes.
引用
收藏
相关论文
共 43 条
[21]   Full-length transcriptome analysis of the bloom-forming dinoflagellate Akashiwo sanguinea by single-molecule real-time sequencing [J].
Chen, Tiantian ;
Liu, Yun ;
Song, Shuqun ;
Bai, Jie ;
Li, Caiwen .
FRONTIERS IN MICROBIOLOGY, 2022, 13
[22]   Single-Molecule Real-Time (SMRT) Isoform Sequencing (Iso-Seq) in Plants: The Status of the Bioinformatics Tools to Unravel the Transcriptome Complexity [J].
Gao, Yubang ;
Xi, Feihu ;
Zhang, Hangxiao ;
Liu, Xuqing ;
Wang, Huiyuan ;
Zhao, Liangzhen ;
Reddy, Anireddy S. N. ;
Gu, Lianfeng .
CURRENT BIOINFORMATICS, 2019, 14 (07) :566-573
[23]   Detecting AGG Interruptions in Females With a FMR1 Premutation by Long-Read Single-Molecule Sequencing: A 1 Year Clinical Experience [J].
Ardui, Simon ;
Race, Valerie ;
de Ravel, Thomy ;
Van Esch, Hilde ;
Devriendt, Koenraad ;
Matthijs, Gert ;
Vermeesch, Joris R. .
FRONTIERS IN GENETICS, 2018, 9
[24]   Molecular Mechanism Underlying the Sorghum sudanense (Piper) Stapf. Response to Osmotic Stress Determined via Single-Molecule Real-Time Sequencing and Next-Generation Sequencing [J].
Liu, Qiuxu ;
Wang, Fangyan ;
Xu, Yalin ;
Lin, Chaowen ;
Li, Xiangyan ;
Xu, Wenzhi ;
Wang, Hong ;
Zhu, Yongqun .
PLANTS-BASEL, 2023, 12 (14)
[25]   A benchmark study of ab initio gene prediction methods in diverse eukaryotic organisms [J].
Scalzitti, Nicolas ;
Jeannin-Girardon, Anne ;
Collet, Pierre ;
Poch, Olivier ;
Thompson, Julie D. .
BMC GENOMICS, 2020, 21 (01)
[26]   A benchmark study of ab initio gene prediction methods in diverse eukaryotic organisms [J].
Nicolas Scalzitti ;
Anne Jeannin-Girardon ;
Pierre Collet ;
Olivier Poch ;
Julie D. Thompson .
BMC Genomics, 21
[27]   Transcriptome Comparative Analysis of Salt Stress Responsiveness in Chrysanthemum (Dendranthema grandiflorum) Roots by Illumina- and Single-Molecule Real-Time-Based RNA Sequencing [J].
Zhao, Qian ;
He, Ling ;
Wang, Bei ;
Liu, Qing-Lin ;
Pan, Yuan-Zhi ;
Zhang, Fan ;
Jiang, Bei-Bei ;
Zhang, Lei ;
Liu, Guang-Li ;
Jia, Yin .
DNA AND CELL BIOLOGY, 2018, 37 (12) :1016-1030
[28]   Exploring the hepatitis C virus genome using single molecule realtime sequencing [J].
Haruhiko Takeda ;
Taiki Yamashita ;
Yoshihide Ueda ;
Akihiro Sekine .
World Journal of Gastroenterology, 2019, (32) :4661-4672
[29]   Assessment and refinement of eukaryotic gene structure prediction with gene-structure-aware multiple protein sequence alignment [J].
Gotoh, Osamu ;
Morita, Mariko ;
Nelson, David R. .
BMC BIOINFORMATICS, 2014, 15
[30]   Assessment and refinement of eukaryotic gene structure prediction with gene-structure-aware multiple protein sequence alignment [J].
Osamu Gotoh ;
Mariko Morita ;
David R Nelson .
BMC Bioinformatics, 15