GenMap: ultra-fast computation of genome mappability

被引:100
|
作者
Pockrandt, Christopher [1 ,2 ,3 ,4 ]
Alzamel, Mai [5 ,6 ]
Iliopoulos, Costas S. [5 ]
Reinert, Knut [3 ,4 ]
机构
[1] Johns Hopkins Univ, Sch Med, Ctr Computat Biol, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Dept Biomed Engn, Baltimore, MD 21218 USA
[3] Free Univ Berlin, Dept Comp Sci & Math, Berlin, Germany
[4] Max Planck Inst Mol Genet, Dept Computat Mol Biol, Berlin, Germany
[5] Kings Coll London, Dept Informat, London, England
[6] King Saud Univ, Dept Comp Sci, Riyadh, Saudi Arabia
基金
美国国家卫生研究院;
关键词
RESOURCE; DNA;
D O I
10.1093/bioinformatics/btaa222
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Computing the uniqueness of k-mers for each position of a genome while allowing for up to e mismatches is computationally challenging. However, it is crucial for many biological applications such as the design of guide RNA for CRISPR experiments. More formally, the uniqueness or (k, e)-mappability can be described for every position as the reciprocal value of how often this k-mer occurs approximately in the genome, i.e. with up to e mismatches. Results: We present a fast method GenMap to compute the (k, e)-mappability. We extend the mappability algorithm, such that it can also be computed across multiple genomes where a k-mer occurrence is only counted once per genome. This allows for the computation of marker sequences or finding candidates for probe design by identifying approximate k-mers that are unique to a genome or that are present in all genomes. GenMap supports different formats such as binary output, wig and bed files as well as csv files to export the location of all approximate k-mers for each genomic position.
引用
收藏
页码:3687 / 3692
页数:6
相关论文
共 50 条
  • [1] Fast Computation and Applications of Genome Mappability
    Derrien, Thomas
    Estelle, Jordi
    Marco Sola, Santiago
    Knowles, David G.
    Raineri, Emanuele
    Guigo, Roderic
    Ribeca, Paolo
    PLOS ONE, 2012, 7 (01):
  • [2] Faster Computation of Genome Mappability
    Hooshmand, Sahar
    Abedin, Paniz
    Gibney, Daniel
    Aluru, Srinivas
    Thankachan, Sharma, V
    ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2018, : 537 - 537
  • [3] Faster Computation of Genome Mappability with one Mismatch
    Hooshmand, Sahar
    Abedin, Paniz
    Gibney, Daniel
    Aluru, Srinivas
    Thankachan, Sharma V.
    2018 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2018,
  • [4] Ultra-fast computation of fractal dimension for RGB images
    de Miras, Juan Ruiz
    Li, Yurong
    Leon, Alejandro
    Arroyo, German
    Lopez, Luis
    Torres, Juan Carlos
    Martin, Domingo
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [5] FLAK: Ultra-Fast Fuzzy Whole Genome Alignment
    Healy, John
    10TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2016, 477 : 123 - 131
  • [6] SpeedSeq: ultra-fast personal genome analysis and interpretation
    Colby Chiang
    Ryan M Layer
    Gregory G Faust
    Michael R Lindberg
    David B Rose
    Erik P Garrison
    Gabor T Marth
    Aaron R Quinlan
    Ira M Hall
    Nature Methods, 2015, 12 : 966 - 968
  • [7] SpeedSeq: ultra-fast personal genome analysis and interpretation
    Chiang, Colby
    Layer, Ryan M.
    Faust, Gregory G.
    Lindberg, Michael R.
    Rose, David B.
    Garrison, Erik P.
    Marth, Gabor T.
    Quinlan, Aaron R.
    Hall, Ira M.
    NATURE METHODS, 2015, 12 (10) : 966 - 968
  • [8] RRCA: Ultra-Fast Multiple In-species Genome Alignments
    Wandelt, Sebastian
    Leser, Ulf
    ALGORITHMS FOR COMPUTATIONAL BIOLOGY, 2014, 8542 : 247 - 261
  • [9] ULTRA-FAST COUNTERS
    NEDDERMEYER, SH
    PHYSICAL REVIEW, 1947, 72 (08): : 741 - 741
  • [10] ULTRA-FAST LASERS
    MIGUS, A
    ANTONETTI, A
    ORSZAG, A
    RECHERCHE, 1987, 18 (187): : 432 - 442