Pulse Width Allocation and Clock Skew Scheduling: Optimizing Sequential Circuits Based on Pulsed Latches

被引:10
作者
Lee, Hyein [1 ]
Paik, Seungwhun [2 ]
Shin, Youngsoo [2 ]
机构
[1] Samsung Elect, Yongin 449711, Gyeonggi Do, South Korea
[2] Korea Adv Inst Sci & Technol, Dept Elect Engn, Taejon 305701, South Korea
关键词
Clock period; clock skew scheduling; clock tree; pulsed latch; sequential circuit;
D O I
10.1109/TCAD.2010.2041845
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Pulsed latches, latches driven by a brief clock pulse, offer the same convenience of timing verification and optimization as flip-flop-based circuits, while retaining the advantages of latches over flip-flops. But a pulsed latch that uses a single pulse width has a lower bound on its clock period, limiting its capacity to deal with higher frequencies or operate at lower V-dd. The limitation still exists even when clock skew scheduling is employed, since the amount of skew that can be assigned and realized is practically limited due to process variation. For the first time, we formulate the problem of allocating pulse widths, out of a small discrete number of predefined widths, and scheduling clock skews, within a predefined upper bound on skew, for optimizing pulsed latch-based sequential circuits. We then present an algorithm called PWCS Optimize (pulse width allocation and clock skew scheduling, PWCS) to solve the problem. The allocated skews are realized through synthesis of local clock trees between pulse generators and latches, and a global clock tree between a clock source and pulse generators. Experiments with 65-nm technology demonstrate that combining a small number of different pulse widths with clock skews of up to 10% of the clock period yield the minimum achievable clock period for many benchmark circuits. The results have an average figure of merit of 0.86, where 1.0 indicates a minimum clock period, and the average reduction in area by 11%. The design flow including PWCS_Optimize, placement and routing, and synthesis of local and global clock trees is presented and assessed with example circuits.
引用
收藏
页码:355 / 366
页数:12
相关论文
共 29 条
  • [1] A 1.3-GHz fifth-generation SPARC64 microprocessor
    Ando, H
    Yoshida, Y
    Inoue, A
    Sugiyama, I
    Asakawa, T
    Morita, K
    Muta, T
    Motokurumada, T
    Okada, S
    Yamashita, H
    Satsukawa, Y
    Konmoto, A
    Yamashita, R
    Sugiyama, H
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2003, 38 (11) : 1896 - 1905
  • [2] [Anonymous], UCBERLM9241
  • [3] Carrig K.M., 2000, IBM Micronews, V6, P12
  • [4] Chiang CC, 2007, INTEGR CIRCUIT SYST, P1, DOI 10.1007/978-1-4020-5188-3
  • [5] CHINNERY D, 2002, CLOSING GAP ASIC CUS, P4
  • [6] An embedded 32-b microprocessor core for low-power and high-performance applications
    Clark, LT
    Hoffman, EJ
    Miller, J
    Biyani, M
    Liao, YY
    Strazdus, S
    Morrow, M
    Velarde, KE
    Yarch, MA
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2001, 36 (11) : 1599 - 1608
  • [7] Retiming revisited and reversed
    Even, G
    Spillinger, IY
    Stok, L
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1996, 15 (03) : 348 - 357
  • [8] CLOCK SKEW OPTIMIZATION
    FISHBURN, JP
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (07) : 945 - 951
  • [9] Held S, 2003, ICCAD-2003: IEEE/ACM DIGEST OF TECHNICAL PAPERS, P232
  • [10] Kohira Y, 2004, PROCEEDINGS OF THE 2004 IEEE ASIA-PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, VOL 1 AND 2, P533