Fast algorithms for finding disjoint subsequences with extremal densities

被引:3
作者
Bergkvist, Anders
Damaschke, Peter [1 ]
机构
[1] Chalmers Univ Technol, Sch Engn & Comp Sci, S-41296 Gothenburg, Sweden
[2] Univ Gothenburg, Dept Mol Biol, S-40530 Gothenburg, Sweden
关键词
holes in data; range prediction; protein torsion angle; protein structure prediction; dynamic programming; selection algorithms; time complexity;
D O I
10.1016/j.patcog.2006.01.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We derive fast algorithms for the following problem: given a set of n points on the real line and two parameters s and p, find s disjoint intervals of maximum total length that contain at most p of the given points. Our main contribution consists of algorithms whose time bounds improve upon a straightforward dynamic programming algorithm, in the relevant case that input size n is much bigger than parameters s and p. These results are achieved by selecting a few candidate intervals that are provably sufficient for building an optimal solution via dynamic programming. As a byproduct of this idea we improve an algorithm for a similar subsequence problem of Chen et al. [Disjoint segments with maximum density, in: International Workshop on Bioinformatics Research and Applications IWBRA 2005, (within ICCS 2005), Lecture Notes in Computer Science, vol. 3515, Springer, Berlin, pp. 845-850]. The problems are motivated by the search for significant patterns in biological data. Finally, we propose several heuristics that further reduce the time complexity in typical instances. One of them leads to an apparently open subsequence sum problem of independent interest. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:2281 / 2292
页数:12
相关论文
共 29 条