The bioinformatics tools for the genome assembly and analysis based on third-generation sequencing

被引:30
作者
Wee, YongKiat [1 ]
Bhyan, Salma Begum [1 ]
Liu, Yining [2 ]
Lu, Jiachun [3 ]
Li, Xiaoyan [4 ]
Zhao, Min [1 ]
机构
[1] Univ Sunshine Coast, Sippy Downs, Qld, Australia
[2] Guangzhou Med Univ, Guangzhou, Guangdong, Peoples R China
[3] Guangzhou Med Univ, Sch Publ Hlth, Affiliated Hosp 1, Guangzhou, Guangdong, Peoples R China
[4] Capital Med Univ, Beijing Anzhen Hosp, Beijing, Peoples R China
关键词
third-generation sequencing; genome assembly; mapping; genome sequencing; HYBRID ERROR-CORRECTION; SINGLE-MOLECULE; LONG READS; ACCURATE; EFFICIENT;
D O I
10.1093/bfgp/ely037
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The application of third-generation sequencing (TGS) technology in genetics and genomics have provided opportunities to categorize and explore the individual genomic landscapes and mutations relevant for diagnosis and therapy using whole genome sequencing and de novo genome assembly. In general, the emerging TGS technology can produce high quality long reads for the determination of overlapping reads and transcript isoforms. However, this technology still faces challenges such as the accuracy for the identification of nucleotide bases and high error rates. Here, we surveyed 39 TGS-related tools for de novo assembly and genome analysis to identify the differences among their characteristics, such as the required input, the interaction with the user, sequencing platforms, type of reads, error models, the possibility of introducing coverage bias, the simulation of genomic variants and outputs provided. The decision trees are summarized to help researchers to find out the most suitable tools to analyze the TGS data. Our comprehensive survey and evaluation of computational features of existing methods for TGS may provide a valuable guideline for researchers.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 57 条
[1]   Limitations of next-generation genome sequence assembly [J].
Alkan, Can ;
Sajjadian, Saba ;
Eichler, Evan E. .
NATURE METHODS, 2011, 8 (01) :61-65
[2]  
Baker EAG, 2016, BIORXIV, P1, DOI [10.1101/076901, DOI 10.1101/076901]
[3]   RETRACTED: Evaluation of next-generation sequencing software in mapping and assembly (Retracted article. See vol. 56, pg. 687, 2011) [J].
Bao, Suying ;
Jiang, Rui ;
Kwan, WingKeung ;
Wang, BinBin ;
Ma, Xu ;
Song, You-Qiang .
JOURNAL OF HUMAN GENETICS, 2011, 56 (06) :406-414
[4]   Solexa Ltd [J].
Bennett, S .
PHARMACOGENOMICS, 2004, 5 (04) :433-438
[5]   Assembling large genomes with single-molecule sequencing and locality-sensitive hashing [J].
Berlin, Konstantin ;
Koren, Sergey ;
Chin, Chen-Shan ;
Drake, James P. ;
Landolin, Jane M. ;
Phillippy, Adam M. .
NATURE BIOTECHNOLOGY, 2015, 33 (06) :623-+
[6]   DeepNano: Deep recurrent neural networks for base calling in MinION nanopore reads [J].
Boza, Vladimir ;
Brejova, Brona ;
Vinar, Tomas .
PLOS ONE, 2017, 12 (06)
[7]   Nanopore sequencing technology and tools for genome assembly: computational analysis of the current state, bottlenecks and future directions [J].
Cali, Damla Senol ;
Kim, Jeremie S. ;
Ghose, Saugata ;
Alkan, Can ;
Mutlu, Onur .
BRIEFINGS IN BIOINFORMATICS, 2019, 20 (04) :1542-1559
[8]   De novo assembly of a haplotype-resolved human genome [J].
Cao, Hongzhi ;
Wu, Honglong ;
Luo, Ruibang ;
Huang, Shujia ;
Sun, Yuhui ;
Tong, Xin ;
Xie, Yinlong ;
Liu, Binghang ;
Yang, Hailong ;
Zheng, Hancheng ;
Li, Jian ;
Li, Bo ;
Wang, Yu ;
Yang, Fang ;
Sun, Peng ;
Liu, Siyang ;
Gao, Peng ;
Huang, Haodong ;
Sun, Jing ;
Chen, Dan ;
He, Guangzhu ;
Huang, Weihua ;
Huang, Zheng ;
Li, Yue ;
Tellier, Laurent C. A. M. ;
Liu, Xiao ;
Feng, Qiang ;
Xu, Xun ;
Zhang, Xiuqing ;
Bolund, Lars ;
Krogh, Anders ;
Kristiansen, Karsten ;
Drmanac, Radoje ;
Drmanac, Snezana ;
Nielsen, Rasmus ;
Li, Songgang ;
Wang, Jian ;
Yang, Huanming ;
Li, Yingrui ;
Wong, Gane Ka-Shu ;
Wang, Jun .
NATURE BIOTECHNOLOGY, 2015, 33 (06) :617-+
[9]  
Chin CS, 2016, NAT METHODS, V13, P1050, DOI [10.1038/nmeth.4035, 10.1038/NMETH.4035]
[10]  
Chin CS, 2013, NAT METHODS, V10, P563, DOI [10.1038/nmeth.2474, 10.1038/NMETH.2474]