Optimizing and benchmarking de novo transcriptome sequencing: from library preparation to assembly evaluation
被引:61
|
作者:
Hara, Yuichiro
论文数: 0引用数: 0
h-index: 0
机构:
RIKEN Ctr Life Sci Technol, Phyloinformat Unit, Chuo Ku, Kobe, Hyogo 6500047, JapanRIKEN Ctr Life Sci Technol, Phyloinformat Unit, Chuo Ku, Kobe, Hyogo 6500047, Japan
Hara, Yuichiro
[1
]
Tatsumi, Kaori
论文数: 0引用数: 0
h-index: 0
机构:
RIKEN Ctr Life Sci Technol, Phyloinformat Unit, Chuo Ku, Kobe, Hyogo 6500047, JapanRIKEN Ctr Life Sci Technol, Phyloinformat Unit, Chuo Ku, Kobe, Hyogo 6500047, Japan
Tatsumi, Kaori
[1
]
Yoshida, Michio
论文数: 0引用数: 0
h-index: 0
机构:
RIKEN Ctr Dev Biol, Lab Vertebrate Body Plan, Chuo Ku, Kobe, Hyogo 6500047, JapanRIKEN Ctr Life Sci Technol, Phyloinformat Unit, Chuo Ku, Kobe, Hyogo 6500047, Japan
Yoshida, Michio
[2
]
Kajikawa, Eriko
论文数: 0引用数: 0
h-index: 0
机构:
RIKEN Ctr Dev Biol, Lab Vertebrate Body Plan, Chuo Ku, Kobe, Hyogo 6500047, JapanRIKEN Ctr Life Sci Technol, Phyloinformat Unit, Chuo Ku, Kobe, Hyogo 6500047, Japan
Kajikawa, Eriko
[2
]
Kiyonari, Hiroshi
论文数: 0引用数: 0
h-index: 0
机构:
RIKEN Ctr Life Sci Technol, Anim Resource Dev Unit, Chuo Ku, Kobe, Hyogo 6500047, Japan
RIKEN Ctr Life Sci Technol, Genet Engn Team, Chuo Ku, Kobe, Hyogo 6500047, JapanRIKEN Ctr Life Sci Technol, Phyloinformat Unit, Chuo Ku, Kobe, Hyogo 6500047, Japan
Kiyonari, Hiroshi
[3
,4
]
Kuraku, Shigehiro
论文数: 0引用数: 0
h-index: 0
机构:
RIKEN Ctr Life Sci Technol, Phyloinformat Unit, Chuo Ku, Kobe, Hyogo 6500047, JapanRIKEN Ctr Life Sci Technol, Phyloinformat Unit, Chuo Ku, Kobe, Hyogo 6500047, Japan
Kuraku, Shigehiro
[1
]
机构:
[1] RIKEN Ctr Life Sci Technol, Phyloinformat Unit, Chuo Ku, Kobe, Hyogo 6500047, Japan
[2] RIKEN Ctr Dev Biol, Lab Vertebrate Body Plan, Chuo Ku, Kobe, Hyogo 6500047, Japan
[3] RIKEN Ctr Life Sci Technol, Anim Resource Dev Unit, Chuo Ku, Kobe, Hyogo 6500047, Japan
[4] RIKEN Ctr Life Sci Technol, Genet Engn Team, Chuo Ku, Kobe, Hyogo 6500047, Japan
Background: RNA-seq enables gene expression profiling in selected spatiotemporal windows and yields massive sequence information with relatively low cost and time investment, even for non-model species. However, there remains a large room for optimizing its workflow, in order to take full advantage of continuously developing sequencing capacity. Method: Transcriptome sequencing for three embryonic stages of Madagascar ground gecko (Paroedura picta) was performed with the Illumina platform. The output reads were assembled de novo for reconstructing transcript sequences. In order to evaluate the completeness of transcriptome assemblies, we prepared a reference gene set consisting of vertebrate one-to-one orthologs. Result: To take advantage of increased read length of >150 nt, we demonstrated shortened RNA fragmentation time, which resulted in a dramatic shift of insert size distribution. To evaluate products of multiple de novo assembly runs incorporating reads with different RNA sources, read lengths, and insert sizes, we introduce a new reference gene set, core vertebrate genes (CVG), consisting of 233 genes that are shared as one-to-one orthologs by all vertebrate genomes examined (29 species)., The completeness assessment performed by the computational pipelines CEGMA and BUSCO referring to CVG, demonstrated higher accuracy and resolution than with the gene set previously established for this purpose. As a result of the assessment with CVG, we have derived the most comprehensive transcript sequence set of the Madagascar ground gecko by means of assembling individual libraries followed by clustering the assembled sequences based on their overall similarities. Conclusion: Our results provide several insights into optimizing de novo RNA-seq workflow, including the coordination between library insert size and read length, which manifested in improved connectivity of assemblies. The approach and assembly assessment with CVG demonstrated here would be applicable to transcriptome analysis of other species as well as whole genome analyses.
机构:
Univ Western Ontario, Dept Biol, 1151 Richmond St N, London, ON N6A 3K7, CanadaUniv Western Ontario, Dept Biol, 1151 Richmond St N, London, ON N6A 3K7, Canada
Lebenzon, Jacqueline E.
Toxopeus, Jantina
论文数: 0引用数: 0
h-index: 0
机构:
Univ Western Ontario, Dept Biol, 1151 Richmond St N, London, ON N6A 3K7, Canada
St Francis Xavier Univ, Biol Dept, 2321 Notre Dame Ave, Antigonish, NS B2G 2W5, CanadaUniv Western Ontario, Dept Biol, 1151 Richmond St N, London, ON N6A 3K7, Canada
Toxopeus, Jantina
Anthony, Susan E.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Western Ontario, Dept Biol, 1151 Richmond St N, London, ON N6A 3K7, CanadaUniv Western Ontario, Dept Biol, 1151 Richmond St N, London, ON N6A 3K7, Canada
Anthony, Susan E.
Sinclair, Brent J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Western Ontario, Dept Biol, 1151 Richmond St N, London, ON N6A 3K7, CanadaUniv Western Ontario, Dept Biol, 1151 Richmond St N, London, ON N6A 3K7, Canada