An Efficient Filtration Method Based on Variable-Length Seeds for Sequence Alignment

被引:0
|
作者
Guo, Ruidong [1 ,2 ]
Cheng, Haoyu [1 ,2 ]
Xu, Yun [1 ,2 ]
机构
[1] Univ Sci & Technol China, Comp Sch Comp Sci, Key Lab High Performance, Hefei 230027, Anhui, Peoples R China
[2] Natl Univ Def Technol, Collaborat Innovat Ctr High Performance Comp, Changsha 410073, Hunan, Peoples R China
来源
PARALLEL ARCHITECTURE, ALGORITHM AND PROGRAMMING, PAAP 2017 | 2017年 / 729卷
关键词
Read alignment; Bitmapper; Filtration; All mapper; READ ALIGNMENT; ALGORITHM; ACCURATE;
D O I
10.1007/978-981-10-6442-5_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of next-generation sequencing (NGS) platforms, more than billions of reads are produced quickly. Finding all mapping locations of these reads in the reference genome is not only a bioinformatics issue, but also a large-scale computation issue. Existing all mapping tools are usually divided into the two steps, filtration and verification. Filtration step discards some wrong locations and generates candidates. As for verification step, each candidate is mapped to the reference sequence to determine whether it is a mapping location. Statistics indicated that the verification step is the main part of the whole mapping time. That is to say, less candidates lead to less mapping time. Our strategies improve filtration step to decrease the number of candidates. We propose a dynamic programming and two heuristic strategies and integrated them into the filtration step. These strategies are applied in the state-of-the-art all-mapper, Bitmapper. Compared with the advanced all-mappers, experiment results show that our method make a significant progress.
引用
收藏
页码:214 / 223
页数:10
相关论文
empty
未找到相关数据