Systematic Generation of Diverse Benchmarks for DNN Verification

被引:3
|
作者
Xu, Dong [1 ]
Shriver, David [1 ]
Dwyer, Matthew B. [1 ]
Elbaum, Sebastian [1 ]
机构
[1] Univ Virginia, Charlottesville, VA 22904 USA
来源
COMPUTER AIDED VERIFICATION (CAV 2020), PT I | 2020年 / 12224卷
基金
美国国家科学基金会;
关键词
Neural network; Verification; Benchmark; Covering array; FORMAL VERIFICATION; TEST SUITES;
D O I
10.1007/978-3-030-53288-8_5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The field of verification has advanced due to the interplay of theoretical development and empirical evaluation. Benchmarks play an important role in this by supporting the assessment of the state-of-the-art and comparison of alternative verification approaches. Recent years have witnessed significant developments in the verification of deep neural networks, but diverse benchmarks representing the range of verification problems in this domain do not yet exist. This paper describes a neural network verification benchmark generator, GDVB, that systematically varies aspects of problems in the benchmark that influence verifier performance. Through a series of studies, we illustrate how GDVB can assist in advancing the sub-field of neural network verification by more efficiently providing richer and less biased sets of verification problems.
引用
收藏
页码:97 / 121
页数:25
相关论文
共 50 条
  • [1] Benchmarks for the Verification of Safety and Security Properties of PLC Programs in Cooperative Verification Environments
    Ukegbu, Chibuzo
    Mehrpouyan, Hoda
    8TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS ENGINEERING, ICISE 2023, 2023, : 19 - 28
  • [2] A systematic approach for the generation and verification of structural hypotheses
    Elyashberg, Mikhail
    Blinov, Kirill
    Williams, Antony
    MAGNETIC RESONANCE IN CHEMISTRY, 2009, 47 (05) : 371 - 389
  • [3] Probabilistic solar forecasting: Benchmarks, post-processing, verification
    Gneiting, Tilmann
    Lerch, Sebastian
    Schulz, Benedikt
    SOLAR ENERGY, 2023, 252 : 72 - 80
  • [4] BenCGen: A Digital Circuit Generation Tool for Benchmarks
    Andrade, Fabricio Vivas
    Silva, Leandro M.
    Fernandes, Antonio O.
    SBCCI 2008: 21ST SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, PROCEEDINGS, 2008, : 164 - 169
  • [5] GKLEE: Concolic Verification and Test Generation for GPUs
    Li, Guodong
    Li, Peng
    Sawaya, Geof
    Gopalakrishnan, Ganesh
    Ghosh, Indradeep
    Rajan, Sreeranga P.
    ACM SIGPLAN NOTICES, 2012, 47 (08) : 215 - 224
  • [6] A systematic literature review on benchmarks for evaluating debugging approaches
    Hirsch, Thomas
    Hofer, Birgit
    JOURNAL OF SYSTEMS AND SOFTWARE, 2022, 192
  • [7] Performance benchmarks for a next generation numerical dynamo model
    Matsui, Hiroaki
    Heien, Eric
    Aubert, Julien
    Aurnou, Jonathan M.
    Avery, Margaret
    Brown, Ben
    Buffett, Bruce A.
    Busse, Friedrich
    Christensen, Ulrich R.
    Davies, Christopher J.
    Featherstone, Nicholas
    Gastine, Thomas
    Glatzmaier, Gary A.
    Gubbins, David
    Guermond, Jean-Luc
    Hayashi, Yoshi-Yuki
    Hollerbach, Rainer
    Hwang, Lorraine J.
    Jackson, Andrew
    Jones, Chris A.
    Jiang, Weiyuan
    Kellogg, Louise H.
    Kuang, Weijia
    Landeau, Maylis
    Marti, Philippe
    Olson, Peter
    Ribeiro, Adolfo
    Sasaki, Youhei
    Schaeffer, Nathanael
    Simitev, Radostin D.
    Sheyko, Andrey
    Silva, Luis
    Stanley, Sabine
    Takahashi, Futoshi
    Takehiro, Shin-ichi
    Wicht, Johannes
    Willis, Ashley P.
    GEOCHEMISTRY GEOPHYSICS GEOSYSTEMS, 2016, 17 (05): : 1586 - 1607
  • [8] Verification & Validation Benchmarks for Assessing and Demonstrating the Credibility of Computational Medical Device Evaluation
    Neufeld, Esra
    Kuster, Niels
    2015 9th European Conference on Antennas and Propagation (EuCAP), 2015,
  • [9] Systematic choice of video game benchmarks in Deep Reinforcement Learning
    Gomes, Elvio
    Souza, Marlo
    2021 20TH BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES 2021), 2021, : 162 - 171
  • [10] Overview of TPC Benchmark E: The Next Generation of OLTP Benchmarks
    Hogan, Trish
    PERFORMANCE EVALUATION AND BENCHMARKING, 2009, 5895 : 84 - 98