Fast algorithms for computing the constrained LCS of run-length encoded strings

被引:5
作者
Ann, Hsing-Yen [1 ]
Yang, Chang-Biau [1 ]
Tseng, Chiou-Ting [1 ]
Hor, Chiou-Yi [1 ]
机构
[1] Natl Sun Yat Sen Univ, Dept Comp Sci & Engn, Kaohsiung 80424, Taiwan
关键词
Design of algorithms; Longest common subsequence; Run-length encoding; Constrained LCS; LONGEST COMMON SUBSEQUENCE;
D O I
10.1016/j.tcs.2012.01.038
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The constrained LCS (CLCS) problem, a recent variant of the longest common subsequence (LCS) problem, has gained much attention. Given two sequences X and Y of lengths n and m, respectively, and the constrained sequence P of length r, previous research shows that the CLCS problem can be solved by either an 0(nmr)-time algorithm based upon dynamic programming (DP) techniques or an 0(r R log log(n + m))-time Hunt-Szymanski-like algorithm, where,R is the total r umber of ordered pairs of positions at which the two strings match. In this paper, we investigate the case that X, Y and P are all in run-length encoded (RLE) format, where the numbers of runs are N, M and R, respectively. We first show that when the sequences are encoded, the CLCS problem can be solved by a simple algorithm in 0(nmR + nMr + Nmr) time without decompressing the sequences. Then, we propose a more efficient algorithm with 0(NMr + r x min{q(1), q(2)} + q(3)) time, where q(1) and q(2) denote the numbers of elements in the south and east faces of the matched blocks on the first layer, respectively, and q(3) denotes the number of face elements of all fully matched cuboids in the DP lattice. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 25 条
  • [1] A fast and simple algorithm for computing the longest common subsequence of run-length encoded strings
    Ann, Hsing-Yen
    Yang, Chang-Biau
    Tseng, Chiou-Ting
    Hor, Chiou-Yi
    [J]. INFORMATION PROCESSING LETTERS, 2008, 108 (06) : 360 - 364
  • [2] Matching for run-length encoded strings
    Apostolico, A
    Landau, GM
    Skiena, S
    [J]. JOURNAL OF COMPLEXITY, 1999, 15 (01) : 4 - 16
  • [3] Regular expression constrained sequence alignment
    Arslan, Abdullah N.
    [J]. JOURNAL OF DISCRETE ALGORITHMS, 2007, 5 (04) : 647 - 661
  • [4] Algorithms for the constrained longest common subsequence problems
    Arslan, AN
    Egecioglu, Ö
    [J]. INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2005, 16 (06) : 1099 - 1109
  • [5] The LCA problem revisited
    Bender, MA
    Farach-Colton, M
    [J]. LATIN 2000: THEORETICAL INFORMATICS, 2000, 1776 : 88 - 94
  • [6] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [7] Beam search for the longest common subsequence problem
    Blum, Christian
    Blesa, Maria J.
    Lopez-Ibanez, Manuel
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2009, 36 (12) : 3178 - 3186
  • [8] AN IMPROVED ALGORITHM FOR COMPUTING THE EDIT DISTANCE OF RUN-LENGTH CODED STRINGS
    BUNKE, H
    CSIRIK, J
    [J]. INFORMATION PROCESSING LETTERS, 1995, 54 (02) : 93 - 96
  • [9] On the generalized constrained longest common subsequence problems
    Chen, Yi-Ching
    Chao, Kun-Mao
    [J]. JOURNAL OF COMBINATORIAL OPTIMIZATION, 2011, 21 (03) : 383 - 392
  • [10] A simple algorithm for the constrained sequence problems
    Chin, FYL
    De Santis, A
    Ferrara, AL
    Ho, NL
    Kim, SK
    [J]. INFORMATION PROCESSING LETTERS, 2004, 90 (04) : 175 - 179