High-Performance Placement Engine for Modern Large-Scale FPGAs With Heterogeneity and Clock Constraints

被引:2
作者
Zhu, Ziran [1 ]
Mei, Yangjie [1 ]
Deng, Kangkang [2 ]
He, Huan [3 ]
Chen, Jianli [4 ]
Yang, Jun [1 ]
Chang, Yao-Wen [5 ,6 ]
机构
[1] Southeast Univ, Natl ASIC Syst Engn Res Ctr, Sch Integrated Circuits, Nanjing 210096, Peoples R China
[2] Peking Univ, Beijing Int Ctr Math Res, Beijing 100871, Peoples R China
[3] Hangzhou Huawei Enterprises Telecommun Technol Co, Cent Hardware Planning & Architecture Design Dept, Hangzhou 310000, Peoples R China
[4] Fudan Univ, State Key Lab ASIC & Syst, Shanghai 200433, Peoples R China
[5] Natl Taiwan Univ, Grad Inst Elect Engn, Taipei 10617, Taiwan
[6] Natl Taiwan Univ, Dept Elect Engn, Taipei 10617, Taiwan
关键词
Clock constraint; field-programmable gate array (FPGA); packing; physical design; placement;
D O I
10.1109/TCAD.2023.3329774
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As field-programmable gate array (FPGA) architectures continue to evolve and become more complex, the heterogeneity and clock constraints imposed by modern FPGAs have posed significant challenges to FPGA placement. This article proposes a high-performance placement engine for modern large-scale FPGAs with heterogeneity and clock constraints. To improve efficiency and scalability, we develop a clustering method considering both internal/external connectivity and the balance of block types to build the hierarchy. In each hierarchy level, we propose a hybrid penalty and augmented Lagrangian method (HPALM) to convert the FPGA global placement with heterogeneity and clock constraints into a series of unconstrained optimization subproblems, then use the Adam method to solve each subproblem. In particular, we prove that the HPALM is globally convergent for global placement. Besides, a matching-based IP block legalization is developed to legalize the DSPs and RAMs, and a multistage packing is presented to cluster LUTs and FFs into HCLBs. Finally, we propose a history-based legalization to legalize CLBs in an FPGA, and a simulated-annealing-based detailed placement is presented to reduce the wirelength while maintaining legality. Compared with the state-of-the-art works, experimental results based on the ISPD 2017 contest benchmarks show that the proposed algorithm can achieve the shortest routed wirelength in a reasonable runtime.
引用
收藏
页码:956 / 969
页数:14
相关论文
共 31 条
[1]   Cluster-based logic blocks for FPGAs: Area-efficiency vs. input sharing and size [J].
Betz, V ;
Rose, J .
PROCEEDINGS OF THE IEEE 1997 CUSTOM INTEGRATED CIRCUITS CONFERENCE, 1997, :551-554
[2]  
Betz V., 1997, Field-programmable Logic and Applications. 7th International Workshop, FPL '97. Proceedings, P213
[3]  
Bozorgzadeh E, 2001, PROCEEDINGS OF THE ASP-DAC 2001: ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE 2001, P629, DOI 10.1109/ASPDAC.2001.913379
[4]  
Chan T., 2005, P ISPD, P185
[5]   RippleFPGA: Routability-Driven Simultaneous Packing and Placement for Modern FPGAs [J].
Chen, Gengjie ;
Pui, Chak-Wa ;
Chow, Wing-Kai ;
Lam, Ka-Chun ;
Kuang, Jian ;
Young, Evangeline F. Y. ;
Yu, Bei .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (10) :2022-2035
[6]   Clock-Aware Placement for Large-Scale Heterogeneous FPGAs [J].
Chen, Jianli ;
Lin, Zhifeng ;
Kuo, Yun-Chih ;
Huang, Chau-Chin ;
Chang, Yao-Wen ;
Chen, Shih-Chun ;
Chiang, Chun-Han ;
Kuo, Sy-Yen .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (12) :5042-5055
[7]   NTUplace3: An analytical placer for large-scale mixed-size designs with preplaced blocks and density constraints [J].
Chen, Tung-Chieh ;
Jiang, Zhe-Wei ;
Hsu, Tien-Chang ;
Chen, Hsin-Chen ;
Chang, Yao-Wen .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2008, 27 (07) :1228-1240
[8]  
Chen W. Zhu, 2019, IEEEACM INT CONFCOMP, P1
[9]  
Feng WY, 2012, 2012 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT'12), P8, DOI 10.1109/FPT.2012.6412103
[10]  
Hestenes M. R., 1969, Journal of Optimization Theory and Applications, V4, P303, DOI 10.1007/BF00927673