SOAP2: an improved ultrafast tool for short read alignment

被引:2845
作者
Li, Ruiqiang [1 ,2 ]
Yu, Chang [1 ]
Li, Yingrui [1 ]
Lam, Tak-Wah [3 ]
Yiu, Siu-Ming [3 ]
Kristiansen, Karsten [2 ]
Wang, Jun [1 ,2 ]
机构
[1] Beijing Genom Inst Shenzhen, Shenzhen 518083, Peoples R China
[2] Univ So Denmark, Dept Biochem & Mol Biol, DK-5230 Odense M, Denmark
[3] Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
GENOME; DNA;
D O I
10.1093/bioinformatics/btp336
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
SOAP2 is a significantly improved version of the short oligonucleotide alignment program that both reduces computer memory usage and increases alignment speed at an unprecedented rate. We used a Burrows Wheeler Transformation (BWT) compression index to substitute the seed strategy for indexing the reference sequence in the main memory. We tested it on the whole human genome and found that this new algorithm reduced memory usage from 14.7 to 5.4GB and improved alignment speed by 20-30 times. SOAP2 is compatible with both single-and paired-end reads. Additionally, this tool now supports multiple text and compressed. le formats. A consensus builder has also been developed for consensus assembly and SNP detection from alignment of short reads on a reference genome.
引用
收藏
页码:1966 / 1967
页数:2
相关论文
共 6 条
  • [1] Burrow M., 1994, 124 DIG EQ CORP
  • [2] Compressed indexing and local alignment of DNA
    Lam, T. W.
    Sung, W. K.
    Tam, S. L.
    Wong, C. K.
    Yiu, S. M.
    [J]. BIOINFORMATICS, 2008, 24 (06) : 791 - 797
  • [3] Ultrafast and memory-efficient alignment of short DNA sequences to the human genome
    Langmead, Ben
    Trapnell, Cole
    Pop, Mihai
    Salzberg, Steven L.
    [J]. GENOME BIOLOGY, 2009, 10 (03):
  • [4] Mapping short DNA sequencing reads and calling variants using mapping quality scores
    Li, Heng
    Ruan, Jue
    Durbin, Richard
    [J]. GENOME RESEARCH, 2008, 18 (11) : 1851 - 1858
  • [5] SOAP: short oligonucleotide alignment program
    Li, Ruiqiang
    Li, Yingrui
    Kristiansen, Karsten
    Wang, Jun
    [J]. BIOINFORMATICS, 2008, 24 (05) : 713 - 714
  • [6] The diploid genome sequence of an Asian individual
    Wang, Jun
    Wang, Wei
    Li, Ruiqiang
    Li, Yingrui
    Tian, Geng
    Goodman, Laurie
    Fan, Wei
    Zhang, Junqing
    Li, Jun
    Zhang, Juanbin
    Guo, Yiran
    Feng, Binxiao
    Li, Heng
    Lu, Yao
    Fang, Xiaodong
    Liang, Huiqing
    Du, Zhenglin
    Li, Dong
    Zhao, Yiqing
    Hu, Yujie
    Yang, Zhenzhen
    Zheng, Hancheng
    Hellmann, Ines
    Inouye, Michael
    Pool, John
    Yi, Xin
    Zhao, Jing
    Duan, Jinjie
    Zhou, Yan
    Qin, Junjie
    Ma, Lijia
    Li, Guoqing
    Yang, Zhentao
    Zhang, Guojie
    Yang, Bin
    Yu, Chang
    Liang, Fang
    Li, Wenjie
    Li, Shaochuan
    Li, Dawei
    Ni, Peixiang
    Ruan, Jue
    Li, Qibin
    Zhu, Hongmei
    Liu, Dongyuan
    Lu, Zhike
    Li, Ning
    Guo, Guangwu
    Zhang, Jianguo
    Ye, Jia
    [J]. NATURE, 2008, 456 (7218) : 60 - U1