An End-to-end Oxford Nanopore Basecaller Using Convolution-augmented Transformer

被引:16
作者
Lv, Xuan [1 ]
Chen, Zhiguang [2 ]
Lu, Yutong [2 ]
Yang, Yuedong [2 ]
机构
[1] Natl Univ Def Technol, Sch Comp Sci, Changsha, Peoples R China
[2] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE | 2020年
基金
中国国家自然科学基金;
关键词
Oxford Nanopore sequencing; Third-party Basecaller; Attention; Dynamic convolution; NEURAL-NETWORK; GENOME;
D O I
10.1109/BIBM49941.2020.9313290
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Oxford Nanopore sequencing is fastly becoming an active field in genomics, and it's critical to basecall nucleotide sequences from the complex electrical signals. Many efforts have been devoted to developing new basecalling tools over the years. However, the basecalled reads still suffer from a high error rate and slow speed. Here, we developed an open-source basecalling method, CATCaller, by simultaneously capturing global context through Attention and modeling local dependencies through dynamic convolution. The method was shown to consistently outperform the ONT default basecaller Albacore, Guppy, and a recently developed attention-based method SACall in read accuracy. More importantly, our method is fast through a heterogeneously computational model to integrate both CPUs and GPUs. When compared to SACall, the method is nearly 4 times faster on a single GPU, and is highly scalable in parallelization with a further speedup of 3.3 on a four-GPU node.
引用
收藏
页码:337 / 342
页数:6
相关论文
共 33 条
[1]  
[Anonymous], 2017, bioRxiv
[2]  
[Anonymous], 2019, PYTORCH EXTENSION TO
[3]  
[Anonymous], 2015, ACS SYM SER
[4]  
[Anonymous], Rectified Linear Units Improve Restricted Boltzmann Machines
[5]  
[Anonymous], 2017, ARXIV PREPRINT ARXIV
[6]  
Boza V., 2020, DEEPNANO BLITZ FAST
[7]   DeepNano: Deep recurrent neural networks for base calling in MinION nanopore reads [J].
Boza, Vladimir ;
Brejova, Brona ;
Vinar, Tomas .
PLOS ONE, 2017, 12 (06)
[8]   Nanopore development at Oxford Nanopore [J].
Brown, Clive G. ;
Clarke, James .
NATURE BIOTECHNOLOGY, 2016, 34 (08) :810-811
[9]   What does BERT look at? An Analysis of BERT's Attention [J].
Clark, Kevin ;
Khandelwal, Urvashi ;
Levy, Omer ;
Manning, Christopher D. .
BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019, 2019, :276-286
[10]   Nanocall: an open source basecaller for Oxford Nanopore sequencing data [J].
David, Matei ;
Dursi, L. J. ;
Yao, Delia ;
Boutros, Paul C. ;
Simpson, Jared T. .
BIOINFORMATICS, 2017, 33 (01) :49-55