Cache-Oblivious Dynamic Programming for Bioinformatics

被引:18
作者
Chowdhury, Rezaul Alam [1 ]
Le, Hai-Son [2 ]
Ramachandran, Vijaya [3 ]
机构
[1] Univ Texas Austin, Ctr Computat Visualizat, Inst Computat Engn & Sci, Austin, TX 78712 USA
[2] Carnegie Mellon Univ, Sch Comp Sci, Dept Machine Learning, Pittsburgh, PA 15213 USA
[3] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
Sequence alignment; median; RNA secondary structure prediction; dynamic programming; cache-efficient; cache-oblivious; LINEAR-SPACE ALGORITHM; STRUCTURE PREDICTION; COMPLEXITY; SEQUENCES; ALIGNMENT;
D O I
10.1109/TCBB.2008.94
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment ( or median), both with affine gap costs, and RNA secondary structure prediction with simple pseudoknots. For each of these problems, we present cache-oblivious algorithms that match the best-known time complexity, match or improve the best-known space complexity, and improve significantly over the cache-efficiency of earlier algorithms. We present experimental results which show that our cache-oblivious algorithms run faster than software and implementations based on previous best algorithms for these problems.
引用
收藏
页码:495 / 510
页数:16
相关论文
共 43 条
  • [1] THE INPUT OUTPUT COMPLEXITY OF SORTING AND RELATED PROBLEMS
    AGGARWAL, A
    VITTER, JS
    [J]. COMMUNICATIONS OF THE ACM, 1988, 31 (09) : 1116 - 1127
  • [2] AHO AV, 1976, J ACM, V23, P1, DOI 10.1145/321921.321922
  • [3] Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots
    Akutsu, T
    [J]. DISCRETE APPLIED MATHEMATICS, 2000, 104 (1-3) : 45 - 62
  • [4] OPTIMAL SEQUENCE ALIGNMENT USING AFFINE GAP COSTS
    ALTSCHUL, SF
    ERICKSON, BW
    [J]. BULLETIN OF MATHEMATICAL BIOLOGY, 1986, 48 (5-6) : 603 - 616
  • [5] [Anonymous], 2005, Algorithm Design
  • [6] [Anonymous], 1981, P 13 ANN ACM S THEOR, DOI DOI 10.1145/800076.802486
  • [7] [Anonymous], 1995, Introduction to computational biology: maps, sequences and genomes
  • [8] FAST LINEAR-SPACE COMPUTATIONS OF LONGEST COMMON SUBSEQUENCES
    APOSTOLICO, A
    BROWNE, S
    GUERRA, C
    [J]. THEORETICAL COMPUTER SCIENCE, 1992, 92 (01) : 3 - 17
  • [9] A survey of longest common subsequence algorithms
    Bergroth, L
    Hakonen, H
    Raita, T
    [J]. SPIRE 2000: SEVENTH INTERNATIONAL SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL - PROCEEDINGS, 2000, : 39 - 48
  • [10] The Comparative RNA Web (CRW) Site:: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs:: Correction (vol 3, pg 2, 2002) -: art. no. 15
    Cannone, JJ
    Subramanian, S
    Schnare, MN
    Collett, JR
    D'Souza, LM
    Du, YS
    Feng, B
    Lin, N
    Madabusi, LV
    Müller, KM
    Pande, N
    Shang, ZD
    Yu, N
    Gutell, RR
    [J]. BMC BIOINFORMATICS, 2002, 3 (1)