Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data

被引:16
作者
Cullen, Jonah N. [1 ]
Friedenberg, Steven G. [1 ]
机构
[1] Univ Minnesota, Coll Vet Med, Dept Vet Clin Sci, 1352 Boyd Ave, St Paul, MN 55108 USA
基金
美国农业部;
关键词
whole genome sequencing; pipeline; variants;
D O I
10.1093/g3journal/jkad117
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Advancements in massively parallel short-read sequencing technologies and the associated decreasing costs have led to large and diverse variant discovery efforts across species. However, processing high-throughput short-read sequencing data can be challenging with potential pitfalls and bioinformatics bottlenecks in generating reproducible results. Although a number of pipelines exist that address these challenges, these are often geared toward human or traditional model organism species and can be difficult to configure across institutions. Whole Animal Genome Sequencing (WAGS) is an open-source set of user-friendly, containerized pipelines designed to simplify the process of identifying germline short (SNP and indel) and structural variants (SVs) geared toward the veterinary community but adaptable to any species with a suitable reference genome. We present a description of the pipelines [adapted from the best practices of the Genome Analysis Toolkit (GATK)], along with benchmarking data from both the preprocessing and joint genotyping steps, consistent with a typical user workflow.
引用
收藏
页数:6
相关论文
共 41 条
[31]  
Pedersen B. S., 2020, SMOOVE STRUCTURAL VA
[32]   DELLY: structural variant discovery by integrated paired-end and split-read analysis [J].
Rausch, Tobias ;
Zichner, Thomas ;
Schlattl, Andreas ;
Stuetz, Adrian M. ;
Benes, Vladimir ;
Korbel, Jan O. .
BIOINFORMATICS, 2012, 28 (18) :I333-I339
[33]   Strong signatures of selection in the domestic pig genome [J].
Rubin, Carl-Johan ;
Megens, Hendrik-Jan ;
Barrio, Alvaro Martinez ;
Maqbool, Khurram ;
Sayyab, Shumaila ;
Schwochow, Doreen ;
Wang, Chao ;
Carlborg, Orjan ;
Jern, Patric ;
Jorgensen, Claus B. ;
Archibald, Alan L. ;
Fredholm, Merete ;
Groenen, Martien A. M. ;
Andersson, Leif .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (48) :19529-19536
[34]   Tandem duplication within the DMD gene in Labrador retrievers with a mild clinical phenotype [J].
Shelton, G. Diane ;
Minor, Katie M. ;
Vieira, Natassia M. ;
Kunkel, Louis M. ;
Friedenberg, Steven G. ;
Cullen, Jonah N. ;
Guo, Ling T. ;
Zatz, Mayana ;
Mickelson, James R. .
NEUROMUSCULAR DISORDERS, 2022, 32 (10) :836-841
[35]   An EHPB1L1 Nonsense Mutation Associated with Congenital Dyserythropoietic Anemia and Polymyopathy in Labrador Retriever Littermates [J].
Shelton, G. Diane ;
Minor, Katie M. ;
Guo, Ling T. ;
Thomas-Hollands, Alison ;
Walsh, Koranda A. ;
Friedenberg, Steven G. ;
Cullen, Jonah N. ;
Mickelson, James R. .
GENES, 2022, 13 (08)
[36]   Muscular dystrophy-dystroglycanopathy in a family of Labrador retrievers with a LARGE1 mutation [J].
Shelton, G. Diane ;
Minor, Katie M. ;
Guo, Ling T. ;
Friedenberg, Steven G. ;
Cullen, Jonah N. ;
Hord, Jeffrey M. ;
Venzke, David ;
Anderson, Mary E. ;
Devereaux, Megan ;
Prouty, Sally J. ;
Handelman, Caryl ;
Campbell, Kevin P. ;
Mickelson, James R. .
NEUROMUSCULAR DISORDERS, 2021, 31 (11) :1169-1178
[37]   Measuring dementia carers' unmet need for services - an exploratory mixed method study [J].
Stirling, Christine ;
Andrews, Sharon ;
Croft, Toby ;
Vickers, James ;
Turner, Paul ;
Robinson, Andrew .
BMC HEALTH SERVICES RESEARCH, 2010, 10
[38]  
Van der Auwera GA, 2020, Genomics in the Cloud: Using Docker, GATK, and WDL in Terra
[39]   A novel canine reference genome resolves genomic architecture and uncovers transcript complexity [J].
Wang, Chao ;
Wallerman, Ola ;
Arendt, Maja-Louise ;
Sundstrom, Elisabeth ;
Karlsson, Asa ;
Nordin, Jessika ;
Makelainen, Suvi ;
Pielberg, Gerli Rosengren ;
Hanson, Jeanette ;
Ohlsson, Asa ;
Saellstrom, Sara ;
Ronnberg, Henrik ;
Ljungvall, Ingrid ;
Haggstrom, Jens ;
Bergstrom, Tomas F. ;
Hedhammar, Ake ;
Meadows, Jennifer R. S. ;
Lindblad-Toh, Kerstin .
COMMUNICATIONS BIOLOGY, 2021, 4 (01)
[40]   863 genomes reveal the origin and domestication of chicken [J].
Wang, Ming-Shan ;
Thakur, Mukesh ;
Peng, Min-Sheng ;
Jiang, Yu ;
Frantz, Laurent Alain Francois ;
Li, Ming ;
Zhang, Jin-Jin ;
Wang, Sheng ;
Peters, Joris ;
Otecko, Newton Otieno ;
Suwannapoom, Chatmongkon ;
Guo, Xing ;
Zheng, Zhu-Qing ;
Esmailizadeh, Ali ;
Hirimuthugoda, Nalini Yasoda ;
Ashari, Hidayat ;
Suladari, Sri ;
Zein, Moch Syamsul Arifin ;
Kusza, Szilvia ;
Sohrabi, Saeed ;
Kharrati-Koopaee, Hamed ;
Shen, Quan-Kuan ;
Zeng, Lin ;
Yang, Min-Min ;
Wu, Ya-Jiang ;
Yang, Xing-Yan ;
Lu, Xue-Mei ;
Jia, Xin-Zheng ;
Nie, Qing-Hua ;
Lamont, Susan Joy ;
Lasagna, Emiliano ;
Ceccobelli, Simone ;
Gunwardana, Humpita Gamaralalage Thilini Nisanka ;
Senasige, Thilina Madusanka ;
Feng, Shao-Hong ;
Si, Jing-Fang ;
Zhang, Hao ;
Jin, Jie-Qiong ;
Li, Ming-Li ;
Liu, Yan-Hu ;
Chen, Hong-Man ;
Ma, Cheng ;
Dai, Shan-Shan ;
Bhuiyan, Abul Kashem Fazlul Haque ;
Khan, Muhammad Sajjad ;
Silva, Gamamada Liyanage Lalanie Pradeepa ;
Thi-Thuy Le ;
Mwai, Okeyo Ally ;
Ibrahim, Mohamed Nawaz Mohamed ;
Supple, Megan .
CELL RESEARCH, 2020, 30 (08) :693-701