Efficient Computation of Sequence Mappability

被引:3
|
作者
Alzamel, Mai [1 ,2 ]
Charalampopoulos, Panagiotis [1 ]
Iliopoulos, Costas S. [1 ]
Kociumaka, Tomasz [3 ]
Pissis, Solon P. [1 ]
Radoszewski, Jakub [3 ]
Straszynski, Juliusz [3 ]
机构
[1] Kings Coll London, Dept Informat, London, England
[2] King Saud Univ, Dept Comp Sci, Riyadh, Saudi Arabia
[3] Univ Warsaw, Inst Informat, Warsaw, Poland
来源
STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2018 | 2018年 / 11147卷
关键词
Sequence mappability; Hamming distance; Genome sequencing; Longest common substring with k mismatches; Suffix tree; SUFFIX TREE CONSTRUCTION; ALGORITHM;
D O I
10.1007/978-3-030-00479-8_2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sequence mappability is an important task in genome resequencing. In the (k, m)-mappability problem, for a given sequence T of length n, our goal is to compute a table whose ith entry is the number of indices j not equal i such that length-m substrings of T starting at positions i and j have at most k mismatches. Previous works on this problem focused on heuristic approaches to compute a rough approximation of the result or on the case of k = 1. We present several efficient algorithms for the general case of the problem. Our main result is an algorithm that works in O(n min{m(k), log(k+1)n}) time and O(n) space for k = O(1). It requires a careful adaptation of the technique of Cole et al. [STOC 2004] to avoid multiple counting of pairs of substrings. We also show O(n(2))- time algorithms to compute all results for a fixed m and all k = 0, ..., m or a fixed k and all m = k, ..., n - 1. Finally we show that the (k, m)mappability problem cannot be solved in strongly subquadratic time for k,m = circle minus(log n) unless the Strong Exponential Time Hypothesis fails.
引用
收藏
页码:12 / 26
页数:15
相关论文
共 50 条
  • [41] Efficient computation of the matrix cosine
    Sastre, Jorge
    Ibanez, Javier
    Ruiz, Pedro
    Defez, Emilio
    APPLIED MATHEMATICS AND COMPUTATION, 2013, 219 (14) : 7575 - 7585
  • [42] EFFICIENT COMPUTATION OF NETWORK SENSITIVITIES
    ELTURKY, FM
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1986, 33 (06): : 659 - 664
  • [43] Efficient computation of contrastive explanations
    Artelt, Andre
    Hammer, Barbara
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [44] Efficient check sum computation
    Research Disclosure, 1998, (414):
  • [45] Efficient computation of behavior strategies
    vonStengel, B
    GAMES AND ECONOMIC BEHAVIOR, 1996, 14 (02) : 220 - 246
  • [46] Efficient computation of recurrence diameters
    Kroening, D
    Strichman, O
    VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION, 2003, 2575 : 298 - 309
  • [47] Efficient computation of the Airy propagators
    Ixaru, L. Gr.
    COMPUTER PHYSICS COMMUNICATIONS, 2007, 176 (11-12) : 637 - 641
  • [48] Efficient value of information computation
    Shachter, RD
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1999, : 594 - 601
  • [49] On the efficient computation of Lyapunov exponents
    Udwadia, FE
    vonBremen, HF
    CONTROL OF OSCILLATIONS AND CHAOS - 1997 1ST INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS 1-3, 1997, : 484 - 487
  • [50] Strengthening invariants for efficient computation
    Liu, YHA
    Stoller, SD
    Teitelbaum, T
    SCIENCE OF COMPUTER PROGRAMMING, 2001, 41 (02) : 139 - 172