dynamic programming;
bioinformatics;
longest common subsequence;
mosaic sequence;
design of algorithms;
D O I:
10.1016/j.ipl.2006.11.006
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
The longest common subsequence (LCS) problem can be used to measure the relationship between sequences. In general, the inputs of the LCS problem are two sequences. For finding the relationship between one sequence and a set of sequences, we cannot apply the traditional LCS algorithms immediately. In this paper, we define the mosaic LCS (MLCS) problem of finding a mosaic sequence C, composed of repeatable k sequences in source sequence set S, such that the LCS of C and the target sequence T is maximal. Based on the concept of break points in sequence T, we first propose a divide-and-conquer algorithm with O(n(2)m vertical bar S vertical bar + n(3) log k) time for solving this problem, where n and nt are the length of T and the maximal length of sequences in S, respectively. Furthermore, an improved algorithm with O(n (m + k)vertical bar S vertical bar) time is proposed by applying an efficient preprocessing for the MLCS problem. (c) 2006 Elsevier B.V. All rights reserved.
机构:
Int Univ Ho Chi Minh City VNU, Dept Ind Syst Engn, Quarter 6 Linh Trung Ward, Ho Chi Minh City, VietnamUniv Danang, Univ Sci & Technol, Fac Project Management, Danang, 54 Nguyen Luong Bang, Danang, Vietnam
机构:
Int Univ Ho Chi Minh City VNU, Dept Ind Syst Engn, Quarter 6 Linh Trung Ward, Ho Chi Minh City, VietnamUniv Danang, Univ Sci & Technol, Fac Project Management, Danang, 54 Nguyen Luong Bang, Danang, Vietnam
机构:
Univ Technol Ho Chi Minh City VNU, Fac Comp Sci & Engn, 268 Ly Thuong Kiet St Dist 10, Ho Chi Minh City, VietnamUniv Danang, Univ Sci & Technol, Fac Project Management, Danang, 54 Nguyen Luong Bang, Danang, Vietnam