A Generalize Hardware Debugging Approach for Large Language Models Semi-Synthetic, Datasets

被引：0

作者：

Fu, Weimin ^{[1
]}

Li, Shijie ^{[2
]}

Zhao, Yifang ^{[2
]}

Yang, Kaichen ^{[3
]}

Zhang, Xuan ^{[4
]}

Jin, Yier ^{[2
]}

Guo, Xiaolong ^{[1
]}

机构：

[1] Kansas State Univ, Mike Wiegers Dept Elect & Comp Engn, Manhattan, KS 66506 USA

[2] Univ Sci & Technol China, Sch Cyber Sci & Technol, Hefei 230026, Anhui, Peoples R China

[3] Michigan Technol Univ, Dept Elect & Comp Engn, Houghton, MI 49931 USA

[4] Northeastern Univ, Dept Elect & Comp Engn, Boston, MA 02115 USA

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS | 2025年 / 72卷 / 02期

基金：

美国国家科学基金会;

关键词：

Hardware; Codes; Training; Software; Large language models; Chatbots; Debugging; Synthetic data; Open source hardware; Computer bugs; Large language model; artificial intelligence; hardware debug; version control; electronic design automation; ENERGY;

D O I：

10.1109/TCSI.2024.3487486

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Large Language Models (LLMs) have precipitated emerging trends towards intelligent automation. However, integrating LLMs into the hardware debug domain encounters challenges: the datasets for LLMs for hardware are often plagued by a dual dilemma - scarcity and subpar quality. Traditional hardware debug approaches that rely on experienced labor to generate detailed prompts are not cheaply scalable. Similarly, strategies that depend on existing LLMs and randomly generated prompts fail to achieve sufficient reliability. We propose a directed, semi-synthetic data synthetic method that leverages version control information and journalistic event descriptions. To produce high-quality data, this approach utilizes version control data from hardware projects combined with the 5W1H (Who, What, When, Where, Why, How) journalistic principles. It facilitates the linear scaling of dataset volumes without depending on skilled labor. We have implemented this method on a collected dataset of open-source hardware designs and fine-tuned fifteen general-purpose LLMs to enable their capability in hardware debugging tasks, thereby validating the efficacy of our approach.

引用

页码：623 / 636

页数：14

共 93 条

[1]

2023, Arxiv, DOI [arXiv:2303.08774, 10.48550/arXiv.2303.08774., DOI 10.48550/ARXIV.2303.08774]

[2] On Hardware Security Bug Code Fixes by Prompting Large Language Models [J].

Ahmad, Baleegh ;

Thakur, Shailja ;

Tan, Benjamin ;

Karri, Ramesh ;

Pearce, Hammond .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :4043-4057

[3]

Ahmad B, 2023, Arxiv, DOI arXiv:2302.01215

[4]

Anthropic, 2024, Introducing the next generation of Claude

[5]

Anthropic, 2024, Claude 3.5 Sonnet

[6]

Bai Yuntao, 2022, arXiv

[7] OpenPiton: An Open Source Manycore Research Framework [J].

Balkind, Jonathan ;

McKeown, Michael ;

Fu, Yaosheng ;

Tri Nguyen ;

Zhou, Yanqi ;

Lavrov, Alexey ;

Shahrad, Mohammad ;

Fuchs, Adi ;

Payne, Samuel ;

Liang, Xiaohua ;

Matl, Matthew ;

Wentzlaff, David .

ACM SIGPLAN NOTICES, 2016, 51 (04) :217-232

[8]

Beeching Edward, 2023, Open LLM leaderboard

[9]

Bei Chen, 2022, Arxiv, DOI arXiv:2207.10397

[10]

Bhandari J, 2024, Arxiv, DOI arXiv:2406.17132

← 1 2 3 4 5 6 7 8 9 10 →