ReSeqTools: an integrated toolkit for large-scale next-generation sequencing based resequencing analysis

被引:35
作者
He, W. [1 ,2 ]
Zhao, S. [2 ]
Liu, X. [2 ]
Dong, S. [2 ]
Lv, J. [2 ]
Liu, D. [2 ]
Wang, J. [1 ,2 ]
Meng, Z. [1 ]
机构
[1] China Univ Technol, Sch Biosci & Bioengn, Guangzhou, Guangdong, Peoples R China
[2] BGI Shenzhen, Shenzhen, Peoples R China
关键词
Next-generation sequencing; Resequencing; Toolkit; Sequence variation; GENOME SEQUENCE; ALIGNMENT;
D O I
10.4238/2013.December.4.15
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Large-scale next-generation sequencing (NGS)-based resequencing detects sequence variations, constructs evolutionary histories, and identifies phenotype-related genotypes. However, NGS-based resequencing studies generate extraordinarily large amounts of data, making computations difficult. Effective use and analysis of these data for NGS-based resequencing studies remains a difficult task for individual researchers. Here, we introduce ReSeqTools, a full-featured toolkit for NGS (Illumina sequencing)-based resequencing analysis, which processes raw data, interprets mapping results, and identifies and annotates sequence variations. ReSeqTools provides abundant scalable functions for routine resequencing analysis in different modules to facilitate customization of the analysis pipeline. ReSeqTools is designed to use compressed data files as input or output to save storage space and facilitates faster and more computationally efficient large-scale resequencing studies in a user-friendly manner. It offers abundant practical functions and generates useful statistics during the analysis pipeline, which significantly simplifies resequencing analysis. Its integrated algorithms and abundant sub-functions provide a solid foundation for special demands in resequencing projects. Users can combine these functions to construct their own pipelines for other purposes.
引用
收藏
页码:6275 / 6283
页数:9
相关论文
共 21 条
  • [1] A map of human genome variation from population-scale sequencing
    Altshuler, David
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Collins, Francis S.
    De la Vega, Francisco M.
    Donnelly, Peter
    Egholm, Michael
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Knoppers, Bartha M.
    Lander, Eric S.
    Lehrach, Hans
    Mardis, Elaine R.
    McVean, Gil A.
    Nickerson, DebbieA.
    Peltonen, Leena
    Schafer, Alan J.
    Sherry, Stephen T.
    Wang, Jun
    Wilson, Richard K.
    Gibbs, Richard A.
    Deiros, David
    Metzker, Mike
    Muzny, Donna
    Reid, Jeff
    Wheeler, David
    Wang, Jun
    Li, Jingxiang
    Jian, Min
    Li, Guoqing
    Li, Ruiqiang
    Liang, Huiqing
    Tian, Geng
    Wang, Bo
    Wang, Jian
    Wang, Wei
    Yang, Huanming
    Zhang, Xiuqing
    Zheng, Huisong
    Lander, Eric S.
    Altshuler, David L.
    Ambrogio, Lauren
    Bloom, Toby
    Cibulskis, Kristian
    Fennell, Tim J.
    Gabriel, Stacey B.
    [J]. NATURE, 2010, 467 (7319) : 1061 - 1073
  • [2] Whole-genome sequencing of multiple Arabidopsis thaliana populations
    Cao, Jun
    Schneeberger, Korbinian
    Ossowski, Stephan
    Guenther, Torsten
    Bender, Sebastian
    Fitz, Joffrey
    Koenig, Daniel
    Lanz, Christa
    Stegle, Oliver
    Lippert, Christoph
    Wang, Xi
    Ott, Felix
    Mueller, Jonas
    Alonso-Blanco, Carlos
    Borgwardt, Karsten
    Schmid, Karl J.
    Weigel, Detlef
    [J]. NATURE GENETICS, 2011, 43 (10) : 956 - U60
  • [3] The draft genome of watermelon (Citrullus lanatus) and resequencing of 20 diverse accessions
    Guo, Shaogui
    Zhang, Jianguo
    Sun, Honghe
    Salse, Jerome
    Lucas, William J.
    Zhang, Haiying
    Zheng, Yi
    Mao, Linyong
    Ren, Yi
    Wang, Zhiwen
    Min, Jiumeng
    Guo, Xiaosen
    Murat, Florent
    Ham, Byung-Kook
    Zhang, Zhaoliang
    Gao, Shan
    Huang, Mingyun
    Xu, Yimin
    Zhong, Silin
    Bombarely, Aureliano
    Mueller, Lukas A.
    Zhao, Hong
    He, Hongju
    Zhang, Yan
    Zhang, Zhonghua
    Huang, Sanwen
    Tan, Tao
    Pang, Erli
    Lin, Kui
    Hu, Qun
    Kuang, Hanhui
    Ni, Peixiang
    Wang, Bo
    Liu, Jingan
    Kou, Qinghe
    Hou, Wenju
    Zou, Xiaohua
    Jiang, Jiao
    Gong, Guoyi
    Klee, Kathrin
    Schoof, Heiko
    Huang, Ying
    Hu, Xuesong
    Dong, Shanshan
    Liang, Dequan
    Wang, Juan
    Wu, Kui
    Xia, Yang
    Zhao, Xiang
    Zheng, Zequn
    [J]. NATURE GENETICS, 2013, 45 (01) : 51 - +
  • [4] Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm
    Huang, Xuehui
    Zhao, Yan
    Wei, Xinghua
    Li, Canyang
    Wang, Ahong
    Zhao, Qiang
    Li, Wenjun
    Guo, Yunli
    Deng, Liuwei
    Zhu, Chuanrang
    Fan, Danlin
    Lu, Yiqi
    Weng, Qijun
    Liu, Kunyan
    Zhou, Taoying
    Jing, Yufeng
    Si, Lizhen
    Dong, Guojun
    Huang, Tao
    Lu, Tingting
    Feng, Qi
    Qian, Qian
    Li, Jiayang
    Han, Bin
    [J]. NATURE GENETICS, 2012, 44 (01) : 32 - U53
  • [5] Genome-wide association studies of 14 agronomic traits in rice landraces
    Huang, Xuehui
    Wei, Xinghua
    Sang, Tao
    Zhao, Qiang
    Feng, Qi
    Zhao, Yan
    Li, Canyang
    Zhu, Chuanrang
    Lu, Tingting
    Zhang, Zhiwu
    Li, Meng
    Fan, Danlin
    Guo, Yunli
    Wang, Ahong
    Wang, Lu
    Deng, Liuwei
    Li, Wenjun
    Lu, Yiqi
    Weng, Qijun
    Liu, Kunyan
    Huang, Tao
    Zhou, Taoying
    Jing, Yufeng
    Li, Wei
    Lin, Zhang
    Buckler, Edward S.
    Qian, Qian
    Zhang, Qi-Fa
    Li, Jiayang
    Han, Bin
    [J]. NATURE GENETICS, 2010, 42 (11) : 961 - U76
  • [6] High-throughput genotyping by whole-genome resequencing
    Huang, Xuehui
    Feng, Qi
    Qian, Qian
    Zhao, Qiang
    Wang, Lu
    Wang, Ahong
    Guan, Jianping
    Fan, Danlin
    Weng, Qijun
    Huang, Tao
    Dong, Guojun
    Sang, Tao
    Han, Bin
    [J]. GENOME RESEARCH, 2009, 19 (06) : 1068 - 1076
  • [7] Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection
    Lam, Hon-Ming
    Xu, Xun
    Liu, Xin
    Chen, Wenbin
    Yang, Guohua
    Wong, Fuk-Ling
    Li, Man-Wah
    He, Weiming
    Qin, Nan
    Wang, Bo
    Li, Jun
    Jian, Min
    Wang, Jian
    Shao, Guihua
    Wang, Jun
    Sun, Samuel Sai-Ming
    Zhang, Gengyun
    [J]. NATURE GENETICS, 2010, 42 (12) : 1053 - U41
  • [8] Fast and accurate short read alignment with Burrows-Wheeler transform
    Li, Heng
    Durbin, Richard
    [J]. BIOINFORMATICS, 2009, 25 (14) : 1754 - 1760
  • [9] SOAP: short oligonucleotide alignment program
    Li, Ruiqiang
    Li, Yingrui
    Kristiansen, Karsten
    Wang, Jun
    [J]. BIOINFORMATICS, 2008, 24 (05) : 713 - 714
  • [10] SOAP2: an improved ultrafast tool for short read alignment
    Li, Ruiqiang
    Yu, Chang
    Li, Yingrui
    Lam, Tak-Wah
    Yiu, Siu-Ming
    Kristiansen, Karsten
    Wang, Jun
    [J]. BIOINFORMATICS, 2009, 25 (15) : 1966 - 1967