Systematic Generation of Diverse Benchmarks for DNN Verification

被引:3
作者
Xu, Dong [1 ]
Shriver, David [1 ]
Dwyer, Matthew B. [1 ]
Elbaum, Sebastian [1 ]
机构
[1] Univ Virginia, Charlottesville, VA 22904 USA
来源
COMPUTER AIDED VERIFICATION (CAV 2020), PT I | 2020年 / 12224卷
基金
美国国家科学基金会;
关键词
Neural network; Verification; Benchmark; Covering array; FORMAL VERIFICATION; TEST SUITES;
D O I
10.1007/978-3-030-53288-8_5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The field of verification has advanced due to the interplay of theoretical development and empirical evaluation. Benchmarks play an important role in this by supporting the assessment of the state-of-the-art and comparison of alternative verification approaches. Recent years have witnessed significant developments in the verification of deep neural networks, but diverse benchmarks representing the range of verification problems in this domain do not yet exist. This paper describes a neural network verification benchmark generator, GDVB, that systematically varies aspects of problems in the benchmark that influence verifier performance. Through a series of studies, we illustrate how GDVB can assist in advancing the sub-field of neural network verification by more efficiently providing richer and less biased sets of verification problems.
引用
收藏
页码:97 / 121
页数:25
相关论文
共 50 条
[21]   Mobile Application Verification: A Systematic Mapping Study [J].
Sahinoglu, Mehmet ;
Incki, Koray ;
Aktas, Mehmet S. .
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2015, PT V, 2015, 9159 :147-163
[22]   Using Large Language Models for Aerospace Code Generation: Methods, Benchmarks, and Potential Values [J].
He, Rui ;
Zhang, Liang ;
Lyu, Mengyao ;
Lyu, Liangqing ;
Xue, Changbin .
AEROSPACE, 2025, 12 (06)
[23]   TextGen: a realistic text data content generation method for modern storage system benchmarks [J].
Long-xiang Wang ;
Xiao-she Dong ;
Xing-jun Zhang ;
Yin-feng Wang ;
Tao Ju ;
Guo-fu Feng .
Frontiers of Information Technology & Electronic Engineering, 2016, 17 :982-993
[24]   TextGen: a realistic text data content generation method for modern storage system benchmarks [J].
Wang, Long-xiang ;
Dong, Xiao-she ;
Zhang, Xing-jun ;
Wang, Yin-feng ;
Ju, Tao ;
Feng, Guo-fu .
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2016, 17 (10) :982-993
[25]   A systematic study of DNN based speech enhancement in reverberant and reverberant-noisy environments [J].
Wang, Heming ;
Pandey, Ashutosh ;
Wang, Deliang .
COMPUTER SPEECH AND LANGUAGE, 2025, 89
[26]   AutoPaperBench: An MLLM-Based Framework for Automatic Generation of Paper Understanding Evaluation Benchmarks [J].
Kim, Min-Woo ;
Park, Hyo-Bin ;
Ahn, Hee-Jin ;
Park, Woo-Ram ;
Jeon, Jae-Wan ;
Lee, Kyong-Ha ;
Lee, Ryong ;
Choi, Dong-Geol .
ELECTRONICS, 2025, 14 (06)
[27]   Pattern Generation for Efficient Acceptability Verification of Approximate Circuits [J].
Chao, Wei-Ji ;
Kourfali, Alexandra ;
Lylina, Natalia ;
Wu, Jun-Tsung ;
Yang, Jing-An ;
Wang, Chih-Hao ;
Hsieh, Tong-Yu ;
Wunderlich, Hans-Joachim .
2024 INTERNATIONAL VLSI SYMPOSIUM ON TECHNOLOGY, SYSTEMS AND APPLICATIONS, VLSI TSA, 2024,
[28]   A Case Study: Verification of Specifications of an Embedded System and Generation of Verification Items using Pairwise Testing [J].
Sekizawa, Toshifusa ;
Kotorii, Tsugu .
2013 20TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2013), VOL 2, 2013, :146-151
[29]   AnBx: Automatic Generation and Verification of Security Protocols Implementations [J].
Modesti, Paolo .
FOUNDATIONS AND PRACTICE OF SECURITY (FPS 2015), 2016, 9482 :156-173
[30]   Abstract Simulation Scenario Generation for Autonomous Vehicle Verification [J].
Medrano-Berumen, Christopher ;
Akbas, Mustafa Ilhan .
2019 IEEE SOUTHEASTCON, 2019,