Effective Identification and Annotation of Fungal Genomes

被引:0
作者
Jian Liu
Jia-Liang Sun
Yong-Zhuang Liu
机构
[1] Nankai University,College of Computer Science
[2] Harbin Institute of Technology,School of Computer Science and Technology
来源
Journal of Computer Science and Technology | 2021年 / 36卷
关键词
fungal genome; fungal identification; bioinformatics pipeline;
D O I
暂无
中图分类号
学科分类号
摘要
In the past few decades, the dangers of mycosis have caused widespread concern. With the development of the sequencing technology, the effective analysis of fungal sequencing data has become a hotspot. With the gradual increase of fungal sequencing data, there is now a lack of sufficient approaches for the identification and functional annotation of fungal chromosomal genomes. To overcome this challenge, this paper firstly deals with the approaches of the identification and annotation of fungal genomes based on short and long reads sequenced by using multiple platforms such as Illumina and Pacbio. Then this paper develops an automated bioinformatics pipeline called PFGI for the identification and annotation task. The experimental evaluation on a real-world dataset ENA (European Nucleotide Archive) shows that PFGI provides a user-friendly way to perform fungal identification and annotation based on the sequencing data analysis, and could provide accurate analyzing results, accurate to the species level (97% sequence identity).
引用
收藏
页码:248 / 260
页数:12
相关论文
共 55 条
[1]  
Schuster SC(2008)Next-generation sequencing transforms today’s biology Nature Methods 5 16-18
[2]  
van Dijk EL(2014)Ten years of next-generation sequencing technology Trends in Genetics 30 418-426
[3]  
Auger H(2018)The third revolution in sequencing technology Trends in Genetics 34 666-681
[4]  
Jaszczyszyn Y(2014)Fungal high-throughput taxonomic identification tool for use with next-generation sequencing (FHiTINGS) Journal of Basic Microbiology 54 315-321
[5]  
Thermes C(2015)PIPITS: An automated pipeline for analyses of fungal internal transcribed spacer sequences from the I llumina sequencing platform Methods in Ecology and Evolution 6 973-980
[6]  
van Dijk EL(2014)Prokka: Rapid prokaryotic genome annotation Bioinformatics 30 2068-2069
[7]  
Jaszczyszyn Y(2018)FASTQ: An ultra-fast all-in-one FASTQ preprocessor Bioinformatics 34 i884-i890
[8]  
Naquin D(2014)Trimmomatic: A flexible trimmer for Illumina sequence data Bioinformatics 30 2114-2120
[9]  
Thermes C(2015)MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph Bioinformatics 31 1674-1676
[10]  
Dannemiller KC(2008)Velvet: Algorithms for de novo short read assembly using de Bruijn graphs Genome Research 18 821-829