Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads

被引:6
|
作者
Wang, Anqi [1 ,2 ]
Au, Kin Fai [1 ,2 ,3 ]
机构
[1] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[2] Univ Iowa, Dept Internal Med, Iowa City, IA 52242 USA
[3] Univ Iowa, Dept Biostat, Iowa City, IA 52242 USA
关键词
SINGLE-MOLECULE; NANOPORE; GENOME; ACCURACY;
D O I
10.1186/s13059-019-1885-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The error-prone third-generation sequencing (TGS) long reads can be corrected by the high-quality second-generation sequencing (SGS) short reads, which is referred to as hybrid error correction. We here investigate the influences of the principal algorithmic factors of two major types of hybrid error correction methods by mathematical modeling and analysis on both simulated and real data. Our study reveals the distribution of accuracy gain with respect to the original long read error rate. We also demonstrate that the original error rate of 19% is the limit for perfect correction, beyond which long reads are too error-prone to be corrected by these methods.
引用
收藏
页数:8
相关论文
共 10 条
  • [1] A comparative evaluation of hybrid error correction methods for error-prone long reads
    Fu, Shuhua
    Wang, Anqi
    Au, Kin Fai
    GENOME BIOLOGY, 2019, 20 (1)
  • [2] Automated assembly of centromeres from ultra-long error-prone reads
    Bzikadze, Andrey, V
    Pevzner, Pavel A.
    NATURE BIOTECHNOLOGY, 2020, 38 (11) : 1309 - +
  • [3] Jabba: hybrid error correction for long sequencing reads
    Miclotte, Giles
    Heydari, Mahdi
    Demeester, Piet
    Rombauts, Stephane
    Van de Peer, Yves
    Audenaert, Pieter
    Fostier, Jan
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2016, 11
  • [4] HECIL: A Hybrid Error Correction Algorithm for Long Reads with Iterative Learning
    Choudhury, Olivia
    Chakrabarty, Ankush
    Emrich, Scott J.
    SCIENTIFIC REPORTS, 2018, 8
  • [5] Efficient Hybrid De Novo Error Correction and Assembly for Long Reads
    Kchouk, Mehdi
    Elloumi, Mourad
    2016 27TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2016, : 88 - 92
  • [6] Hybrid Error Correction approach and DeNovo Assembly for MinIon Sequencing Long Reads
    Kchouk, Mehdi
    Elloumi, Mourad
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 122 - 125
  • [7] Ratatosk: hybrid error correction of long reads enables accurate variant calling and assembly
    Holley, Guillaume
    Beyter, Doruk
    Ingimundardottir, Helga
    Moller, Peter L.
    Kristmundsdottir, Snodis
    Eggertsson, Hannes P.
    Halldorsson, Bjarni, V
    GENOME BIOLOGY, 2021, 22 (01)
  • [8] Symbiont-screener: A reference-free tool to separate host sequences from symbionts for error-prone long reads
    Xu, Mengyang
    Guo, Lidong
    Qi, Yanwei
    Shi, Chengcheng
    Liu, Xiaochuan
    Chen, Jianwei
    Han, Jinglin
    Deng, Li
    Liu, Xin
    Fan, Guangyi
    FRONTIERS IN MARINE SCIENCE, 2023, 10
  • [9] deSALT: fast and accurate long transcriptomic read alignment with de Bruijn graph-based index
    Liu, Bo
    Liu, Yadong
    Li, Junyi
    Guo, Hongzhe
    Zang, Tianyi
    Wang, Yadong
    GENOME BIOLOGY, 2019, 20 (01)
  • [10] A hybrid deep learning model based on feature capture of water level influencing factors and prediction error correction for water level prediction of cascade hydropower stations under multiple time scales
    Ma, Xin
    Hu, Hao
    Ren, Yufeng
    JOURNAL OF HYDROLOGY, 2023, 617