A GPU-Accelerated Framework for Path-Based Timing Analysis

被引:6
|
作者
Guo, Guannan [1 ]
Huang, Tsung-Wei [2 ]
Lin, Yibo [3 ]
Guo, Zizheng [3 ]
Yellapragada, Sushma [1 ]
Wong, Martin D. F. [4 ]
机构
[1] Univ Illinois, Elect & Comp Engn, Champaign, IL 61821 USA
[2] Univ Utah, Elect & Comp Engn, Salt Lake City, UT 84124 USA
[3] Peking Univ, Sch Integrated Circuits, Beijing 100871, Peoples R China
[4] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Graphics processing units; Forestry; Timing; Arrays; Kernel; Throughput; Instruction sets; Graphics processing unit (GPU) acceleration; static timing analysis (STA); PARALLEL;
D O I
10.1109/TCAD.2023.3272274
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As a key routine in static timing analysis (STA), path-based analysis (PBA) plays a very important role in refining the critical path report by reducing excessive slack pessimism. PBA is also well known for its long execution time, which makes it a hot topic for parallel computing in the STA community. However, nearly all of the parallel PBA algorithms are restricted to CPU architectures, which greatly limits their scalability. To achieve a new performance milestone on PBA, we must leverage the high throughput computing in the graphics processing unit (GPU). Therefore, in this work, we propose a new GPU-accelerated PBA framework which contains compact data structures and highly efficient kernels. By integrating with GPU-accelerated preprocessing steps, our framework can also effectively handle extensive critical path constraints. Besides, we highlight many optimization techniques that can overcome the execution bottleneck and further boost the performance. In experiments, we demonstrate 543x speed-up compared to the state-of-the-art PBA algorithm on the design with 1.6 million gates, which outperforms 25x - 45x over the state-of-the-art parallel PBA algorithm on 40 CPU cores. A fully optimized framework can achieve 3x - 5x speed-up on top of that.
引用
收藏
页码:4219 / 4232
页数:14
相关论文
共 50 条
  • [1] GPU-accelerated Path-based Timing Analysis
    Guo, Guannan
    Huang, Tsung-Wei
    Lin, Yibo
    Wong, Martin
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 721 - 726
  • [2] GPU-Accelerated Static Timing Analysis
    Guo, Zizheng
    Huang, Tsung-Wei
    Lin, Yibo
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
  • [3] GPU-accelerated Path Rendering
    Kilgard, Mark J.
    Bolz, Jeff
    ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (06):
  • [4] GPU-accelerated Critical Path Generation with Path Constraints
    Guo, Guannan
    Huang, Tsung-Wei
    Lin, Yibo
    Wong, Martin
    2021 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN (ICCAD), 2021,
  • [5] A GPU-Accelerated Framework for Simulating LiDAR Scanning
    Lopez, Alfonso
    Ogayar, Carlos J.
    Jurado, Juan M.
    Feito, Francisco R.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] A GPU-accelerated Framework for Processing Trajectory Queries
    Zhang, Bowen
    Shen, Yanyan
    Zhu, Yanmin
    Yu, Jiadi
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1037 - 1048
  • [7] Fast Path-Based Timing Analysis for CPPR
    Huang, Tsung-Wei
    Wu, Pei-Ci
    Wong, Martin D. F.
    2014 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2014, : 596 - 599
  • [8] GPU-Accelerated Timing Simulation of Systolic-Array-Based AI Accelerators
    Holst, Stefan
    Bumun, Lim
    Wen, Xiaoqing
    2021 IEEE 30TH ASIAN TEST SYMPOSIUM (ATS 2021), 2021, : 127 - 132
  • [9] GPU-Accelerated Adaptive Compression Framework for Genomics Data
    Guo, GuiXin
    Qiu, Shuang
    Ye, ZhiQiang
    Wang, BingQiang
    Fang, Lin
    Lu, Mian
    See, Simon
    Mao, Rui
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [10] GStreamMiner: A GPU-accelerated Data Stream Mining Framework
    HewaNadungodage, Chandima
    Xia, Yuni
    Lee, John Jaehwan
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 2489 - 2492