EXONSAMPLER: a computer program for genome-wide and candidate gene exon sampling for targeted next-generation sequencing

被引:2
|
作者
Cosart, Ted [1 ]
Beja-Pereira, Albano [2 ]
Luikart, Gordon [3 ]
机构
[1] Univ Montana, Div Biol Sci, Missoula, MT 59812 USA
[2] Univ Porto, Ctr Invest Biodiversidade & Recursos Genet CIBIO, P-4485661 Vairao, Portugal
[3] Univ Montana, Div Biol Sci, Flathead Lake Biol Stn, Polson, MT 59860 USA
基金
美国国家科学基金会;
关键词
bioinformatics; exon capture; exon sequences; next-generation sequencing; CAPTURE; BLAST;
D O I
10.1111/1755-0998.12267
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The computer program exonsampler automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next-generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User-adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of exonsampler to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon-capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected similar to 10% of the exome (similar to 3 million bp), including 155 candidate genes, and similar to 16000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection.
引用
收藏
页码:1296 / 1301
页数:6
相关论文
共 50 条
  • [1] GENOME-WIDE AND TARGETED NEXT-GENERATION SEQUENCING, AND INSIGHTS TO THE LENS
    Jamieson, Robyn V.
    CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY, 2013, 41 : 112 - 112
  • [2] Genome-wide gene–gene interaction analysis for next-generation sequencing
    Jinying Zhao
    Yun Zhu
    Momiao Xiong
    European Journal of Human Genetics, 2016, 24 : 421 - 428
  • [3] Genome-wide gene-gene interaction analysis for next-generation sequencing
    Zhao, Jinying
    Zhu, Yun
    Xiong, Momiao
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2016, 24 (03) : 421 - 428
  • [4] Genome-wide Association Study Using Next-generation Sequencing in Spinach
    Shi, Ainong
    Qin, Jun
    Mou, Beiquan
    Correll, Jim
    Weng, Yuejin
    Feng, Chunda
    Motes, Dennis
    Yang, Wei
    Bhattarai, Gehendra
    Ravelombola, Waltram Second
    Dong, Lingdi
    Sugihara, Yuichi
    HORTSCIENCE, 2017, 52 (09) : S359 - S360
  • [5] Efficiently identifying genome-wide changes with next-generation sequencing data
    Huang, Weichun
    Umbach, David M.
    Jordan, Nicole Vincent
    Abell, Amy N.
    Johnson, Gary L.
    Li, Leping
    NUCLEIC ACIDS RESEARCH, 2011, 39 (19)
  • [6] Genome-Wide Copy Number Variation and Targeted Next-Generation Sequencing Studies of Merkel Cell Carcinoma
    Carter, M.
    Gaston, D.
    Huang, W.
    Greer, W.
    Pasternak, S.
    Ly, T.
    Walsh, N. M.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2017, 19 (06): : 1012 - 1013
  • [7] Genome-wide genetic marker discovery and genotyping using next-generation sequencing
    Davey, John W.
    Hohenlohe, Paul A.
    Etter, Paul D.
    Boone, Jason Q.
    Catchen, Julian M.
    Blaxter, Mark L.
    NATURE REVIEWS GENETICS, 2011, 12 (07) : 499 - 510
  • [8] Genome-wide genetic marker discovery and genotyping using next-generation sequencing
    John W. Davey
    Paul A. Hohenlohe
    Paul D. Etter
    Jason Q. Boone
    Julian M. Catchen
    Mark L. Blaxter
    Nature Reviews Genetics, 2011, 12 : 499 - 510
  • [9] Utilization of a Targeted Next-Generation Sequencing Assay for Assessment of Tumor Cellularity, and Genome-Wide and Gene-Specific Loss of Heterozygosity (LOH)
    Gupta, M.
    Sadis, S.
    Veitch, J.
    Bandla, S.
    Conner, K.
    Cyanam, D.
    El-Difrawy, S.
    Ewing, A.
    Kaznadzey, D.
    Kilzer, J.
    Kraltcheva, A.
    Mittal, V.
    Tseng, Y.
    Van Loy, C.
    Williams, P.
    Tom, W.
    Yang, C.
    Au-Young, J.
    Asuncion, L.
    Hyland, F.
    Wong-Ho, E.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2020, 22 (11): : S67 - S67
  • [10] Genome diagnostics: next-generation sequencing, new genome-wide association studies and clinical challenges
    Ziogas, Dimosthenis E.
    Roukos, Dimitrios H.
    EXPERT REVIEW OF MOLECULAR DIAGNOSTICS, 2011, 11 (07) : 663 - 666