Exploiting Spatial Architectures for Edit Distance Algorithms

被引:0
作者
Tithi, Jesmin Jahan [1 ]
Crago, Neal C. [2 ]
Emer, Joel S. [2 ,3 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
[2] Intel Corp, VSSAD, Santa Clara, CA 95051 USA
[3] MIT, CSAIL, Cambridge, MA 02139 USA
来源
2014 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS) | 2014年
关键词
ALIGNMENTS;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we demonstrate the ability of spatial architectures to significantly improve both runtime performance and energy efficiency on edit distance, a broadly used dynamic programming algorithm. Spatial architectures are an emerging class of application accelerators that consist of a network of many small and efficient processing elements that can be exploited by a large domain of applications. In this paper, we utilize the dataflow characteristics and inherent pipeline parallelism within the edit distance algorithm to develop efficient and scalable implementations on a previously proposed spatial accelerator. We evaluate our edit distance implementations using a cycle-accurate performance and physical design model of a previously proposed triggered instruction-based spatial architecture in order to compare against real performance and power measurements on an x86 processor. We show that when chip area is normalized between the two platforms, it is possible to get more than a 50x runtime performance improvement and over 100x reduction in energy consumption compared to an optimized and vectorized x86 implementation. This dramatic improvement comes from lever-aging the massive parallelism available in spatial architectures and from the dramatic reduction of expensive memory accesses through conversion to relatively inexpensive local communication.
引用
收藏
页码:23 / 34
页数:12
相关论文
共 29 条
  • [1] Blelloch G. E., 1997, SPAA '97. 9th Annual ACM Symposium on Parallel Algorithms and Architectures, P249, DOI 10.1145/258492.258517
  • [2] Cache-Oblivious Dynamic Programming
    Chowdhury, Rezaul Alam
    Ramachandran, Vijaya
    [J]. PROCEEDINGS OF THE SEVENTHEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2006, : 591 - 600
  • [3] Cache-Oblivious Dynamic Programming for Bioinformatics
    Chowdhury, Rezaul Alam
    Le, Hai-Son
    Ramachandran, Vijaya
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (03) : 495 - 510
  • [4] Reconfigurable computing: A survey of systems and software
    Compton, K
    Hauck, S
    [J]. ACM COMPUTING SURVEYS, 2002, 34 (02) : 171 - 210
  • [5] Dydel S, 2004, LECT NOTES COMPUT SC, V3203, P23
  • [6] Asim: A performance model framework
    Emer, J
    Ahuja, P
    Borch, E
    Klauser, A
    Luk, CK
    Manne, S
    Mukherjee, SS
    Patil, H
    Wallace, S
    Binkert, N
    Espasa, R
    Juan, T
    [J]. COMPUTER, 2002, 35 (02) : 68 - +
  • [7] Farivar Reza., 2012, Innovative Parallel Computing (InPar), 2012, P1, DOI DOI 10.1109/INPAR.2012.6339593
  • [8] Govindaraju V., 2011, P INT S HIGH PERF CO
  • [9] Grice J A, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P145
  • [10] Garp: A MIPS processor with a reconfigurable coprocessor
    Hauser, JR
    Wawrzynek, J
    [J]. 5TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, 1997, : 12 - 21