Leveraging Models to Reduce Test Cases in Software Repositories

被引:3
作者
Gharachorlu, Golnaz [1 ]
Sumner, Nick [1 ]
机构
[1] Simon Fraser Univ, Burnaby, BC, Canada
来源
2021 IEEE/ACM 18TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2021) | 2021年
基金
加拿大自然科学与工程研究理事会;
关键词
test case reduction; semantic validity; machine learning; compilation errors;
D O I
10.1109/MSR52588.2021.00035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a failing test case, test case reduction yields a smaller test case that reproduces the failure. This process can be time consuming due to repeated trial and error with smaller test cases. Current techniques speed up reduction by only exploring syntactically valid candidates, but they still spend significant effort on semantically invalid candidates. In this paper, we propose a model-guided approach to speed up test case reduction. The approach trains a model of semantic properties driven by syntactic test case properties. By using this model, we can skip testing even syntactically valid test case candidates that are unlikely to succeed. We evaluate this model-guided reduction on a suite of 14 large fuzzer-generated C test cases from the bug repositories of two well-known C compilers, GCC and Clang. Our results show that with an average precision of 77%, we can decrease the number of removal trials by 14% to 61%. We observe a 30% geomean improvement in reduction time over the state of the art technique while preserving similar reduction power.
引用
收藏
页码:230 / 241
页数:12
相关论文
共 29 条
  • [1] Chae K, 2017, P ACM PROGRAM LANG, V1, DOI 10.1145/3133925
  • [2] Gharachorlu G., 2019, PARDIS PRIORITY AWAR
  • [3] PARDIS: Priority Aware Test Case Reduction
    Gharachorlu, Golnaz
    Sumner, Nick
    [J]. FUNDAMENTAL APPROACHES TO SOFTWARE ENGINEERING (FASE 2019), 2019, 11424 : 409 - 426
  • [4] Avoiding the Familiar to Speed Up Test Case Reduction
    Gharachorlu, Golnaz
    Sumner, Nick
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2018), 2018, : 426 - 437
  • [5] DISTRIBUTIONAL STRUCTURE
    Harris, Zellig S.
    [J]. WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1954, 10 (2-3): : 146 - 162
  • [6] Harwell S., 2020, C 2011 GRAMMAR
  • [7] Continuously Reasoning about Programs using Differential Bayesian Inference
    Heo, Kihong
    Raghothaman, Mukund
    Si, Xujie
    Naik, Mayur
    [J]. PROCEEDINGS OF THE 40TH ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '19), 2019, : 561 - 575
  • [8] Effective Program Debloating via Reinforcement Learning
    Heo, Kihong
    Lee, Woosuk
    Pashakhanloo, Pardis
    Naik, Mayur
    [J]. PROCEEDINGS OF THE 2018 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (CCS'18), 2018, : 380 - 394
  • [9] Machine-Learning-Guided Selectively Unsound Static Analysis
    Heo, Kihong
    Oh, Hakjoo
    Yi, Kwangkeun
    [J]. 2017 IEEE/ACM 39TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2017, : 519 - 529
  • [10] Herfert S, 2017, IEEE INT CONF AUTOM, P861, DOI 10.1109/ASE.2017.8115697