A Highly Efficient Substitution Matrix Loader for Pairwise Sequence Alignment

被引:0
作者
Isa, M. Nazrin Md [1 ]
Benkrid, Khaled [1 ]
Clayton, Thomas [1 ]
机构
[1] Univ Edinburgh, Sch Engn, Syst Level Integrat Grp, Edinburgh EH9 3JL, Midlothian, Scotland
来源
2012 24TH INTERNATIONAL CONFERENCE ON MICROELECTRONICS (ICM) | 2012年
关键词
Field Programmable Gate Arrays; Sequence Alignment; Folded Systolic Array; Substitution Matrix; Smith Waterman; Needleman-Wunsch;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a novel substitution matrix loader architecture for pairwise sequence alignment. The search for sequence homology using DP-based alignment matrix computation is an important tool in molecular biology. It can be implemented either by optimal or sub-optimal approaches. Both of these methods require frequent and rapid access to the amino acids probability scores for PE (Processing Element) configuration especially in a folded systolic array. Typical FPGA implementations configure look-up tables in the pipeline PEs either by using a serial configuration chain with different look-up tables or by run time reconfiguration of the same look-up table. In the former case, configuration time increases proportionally to the number of look-up tables, while the latter case suffers from the limited reconfiguration bandwidth. Therefore, in this paper, we propose a highly efficient parallel loader to optimize both time and space complexities of protein sequence alignment in folded systolic arrays, using only two configuration elements (CEs). In addition, the proposed loader enables PEs to be updated with substitution matrix scores concurrently, with the worst case configuration time of 2 x the depth of the PE's look-up table (in clock cycles). This allows for further optimization of the most time consuming alignment matrix computation through efficient scheduling of alignment matrix computation and PE configuration. Implementation results show that the proposed architecture achieves k.N-PE speed-up in configuration time (where k is the folding factor and N-PE is the number of PEs) compared to classical approaches, at virtually no area overhead.
引用
收藏
页数:4
相关论文
共 13 条
  • [1] [Anonymous], 2009, VIRT 5 FAM US GUID
  • [2] [Anonymous], 2011 NASA ESA C AD H
  • [3] A Highly Parameterized and Efficient FPGA-Based Skeleton for Pairwise Biological Sequence Alignment
    Benkrid, Khaled
    Liu, Ying
    Benkrid, AbdSamad
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2009, 17 (04) : 561 - 570
  • [4] Dayhoff M O., 1978, Atlas of Protein Seq Struct, ppp 345
  • [5] Dohi K, 2010, IEEE INT CONF ASAP
  • [6] Durbin R., 1998, Biological sequence analysis: probabilistic models of proteins and nucleic acids
  • [7] AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS
    HENIKOFF, S
    HENIKOFF, JG
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) : 10915 - 10919
  • [8] Liu Y., BMC RES NOTES, V3, P93
  • [9] CUDASW++: Optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units
    Liu Y.
    Maskell D.L.
    Schmidt B.
    [J]. BMC Research Notes, 2 (1)
  • [10] Munekawa Y., 2008, BIOINFORMATICS BIOEN