HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond

被引:0
作者
Abi-Karam, Stefan [1 ,2 ]
Sarkar, Rishov [1 ]
Seigler, Allison [3 ]
Lowe, Sean [4 ]
Wei, Zhigang [3 ]
Chen, Hanqiu [1 ]
Rao, Nanditha [5 ]
John, Lizy [3 ]
Arora, Aman [4 ]
Hao, Cong [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Georgia Tech Res Inst, Atlanta, GA 30332 USA
[3] Univ Texas Austin, Austin, TX USA
[4] Arizona State Univ, Tempe, AZ USA
[5] Int Inst Informat Technol Bangalore, Bangalore, India
来源
2024 ACM/IEEE 6TH SYMPOSIUM ON MACHINE LEARNING FOR CAD, MLCAD 2024 | 2024年
基金
美国国家科学基金会;
关键词
NETWORK;
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning (ML) techniques have been applied to high-level synthesis (HLS) flows for quality-of-result (QoR) prediction and design space exploration (DSE). Nevertheless, the scarcity of accessible high-quality HLS datasets and the complexity of building such datasets present great challenges to FPGA and ML researchers. Existing datasets either cover only a subset of previously published benchmarks, provide no way to enumerate optimization design spaces, are limited to a specific vendor, or have no reproducible and extensible software for dataset construction. Many works also lack user-friendly ways to add more designs to existing datasets, limiting wider adoption and sustainability of such datasets. In response to these challenges, we introduce HLSFactory, a comprehensive framework designed to facilitate the curation and generation of high-quality HLS design datasets. HLSFactory has three main stages: 1) a design space expansion stage to elaborate single HLS designs into large design spaces using various optimization directives across multiple vendor tools, 2) a design synthesis stage to execute HLS and FPGA tool flows concurrently across designs, and 3) a data aggregation stage for extracting standardized data into packaged datasets for ML usage. This tripartite architecture not only ensures broad coverage of data points via design space expansion but also supports interoperability with tools from multiple vendors. Users can contribute to each stage easily by submitting their own HLS designs or synthesis results via provided user APIs. The framework is also flexible, allowing extensions at every step via user APIs with custom frontends, synthesis tools, and scripts. To demonstrate the framework functionality, we include an initial set of built-in base designs from PolyBench, MachSuite, Rosetta, CHStone, Kastner et al.s Parallel Programming for FPGAs, and curated kernels from existing open-source HLS designs. We report the statistical analyses and design space visualizations to demonstrate the completed end-to-end compilation flow, and to highlight the effectiveness of our design space expansion beyond the initial base dataset, which greatly contributes to dataset diversity and coverage. In addition to its evident application in ML, we showcase the versatility and multi-functionality of our framework through seven case studies: I) Building an ML model for post-implementation QoR prediction; II) Using design space sampling in stage 1 to expand the design space covered from a small base set of HLS designs; III) Demonstrating the speedup from the fine-grained design parallelism backend; IV) Extending HLSFactory to target Intels HLS flow across all stages; V) Adding and running new auxiliary designs using HLSFactory; VI) Integration of previously published HLS data in stage 3; VII) Using HLSFactory to perform HLS tool version regression benchmarking. Code available at https://github.com/sharc-lab/HLSFactory.
引用
收藏
页数:9
相关论文
共 37 条
[1]  
AMD/Xilinx, 2021, Basic Examples for Vitis HLS
[2]  
AMD/Xilinx, 2022, Vitis Accel Examples' Repository
[3]  
[Anonymous], Polybench
[4]  
Bai Yunsheng, 2023, 37 C NEUR INF PROC S
[5]   DGNN-Booster: A Generic FPGA Accelerator Framework For Dynamic Graph Neural Network Inference [J].
Chen, Hanqiu ;
Hao, Cong .
2023 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, FCCM, 2023, :195-201
[6]   Mask-Net: A Hardware-efficient Object Detection Network with Masked Region Proposals [J].
Chen, Hanqiu ;
Hao, Cong .
2022 IEEE 33RD INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2022, :131-138
[7]   MAVIREC: ML-Aided Vectored IR-Drop Estimation and Classification [J].
Chhabria, Vidya A. ;
Zhang, Yanqing ;
Ren, Haoxing ;
Keller, Ben ;
Khailany, Brucek ;
Sapatnekar, Sachin S. .
PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, :1825-1828
[8]  
CliffordWolf Johann, 2013, P 21 AUSTR WORKSH MI
[9]  
Cong Jason, FPGAs for Software Programmers, P137, DOI [10.1007/978-3-319-26408-08, DOI 10.1007/978-3-319-26408-08]
[10]  
Conover W.J., 1999, Practical non-parametric statistics (third edition), V3rd