Generative Model-Based Testing on Decision-Making Policies

被引:4
|
作者
Li, Zhuo [1 ]
Wu, Xiongfei [1 ]
Zhu, Derui [2 ]
Cheng, Mingfei [3 ]
Chen, Siyuan [1 ]
Zhang, Fuyuan [1 ]
Xie, Xiaofei [3 ]
Ma, Lei [4 ,5 ]
Zhao, Jianjun [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Tech Univ Munich, Munich, Germany
[3] Singapore Management Univ, Singapore, Singapore
[4] Univ Tokyo, Tokyo, Japan
[5] Univ Alberta, Edmonton, AB, Canada
来源
2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE | 2023年
基金
新加坡国家研究基金会; 加拿大自然科学与工程研究理事会;
关键词
generative model; testing; decision-making policies; COMPREHENSIVE SURVEY; REINFORCEMENT; SYSTEMS; GO;
D O I
10.1109/ASE56229.2023.00153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reliability of decision-making policies is urgently important today as they have established the fundamentals of many critical applications, such as autonomous driving and robotics. To ensure reliability, there have been a number of research efforts on testing decision-making policies that solve Markov decision processes (MDPs). However, due to the deep neural network (DNN)-based inherit and infinite state space, developing scalable and effective testing frameworks for decision-making policies still remains open and challenging. In this paper, we present an effective testing framework for decision-making policies. The framework adopts a generative diffusion model-based test case generator that can easily adapt to different search spaces, ensuring the practicality and validity of test cases. Then, we propose a termination state novelty-based guidance to diversify agent behaviors and improve the test effectiveness. Finally, we evaluate the framework on five widely used benchmarks, including autonomous driving, aircraft collision avoidance, and gaming scenarios. The results demonstrate that our approach identifies more diverse and influential failure-triggering test cases compared to current state-of-the-art techniques. Moreover, we employ the detected failure cases to repair the evaluated models, achieving better robustness enhancement compared to the baseline method.
引用
收藏
页码:243 / 254
页数:12
相关论文
共 50 条
  • [41] Workshop on advances in model-based software testing
    Dalal, S
    Jain, A
    Poore, J
    ICSE 05: 27th International Conference on Software Engineering, Proceedings, 2005, : 680 - 680
  • [42] Continuous Action Air Combat Maneuver Decision-Making Based on T-MGMM
    Jiang, Junzhe
    Wang, Hongming
    Huang, Zhixing
    Zhou, Zhuangfeng
    Wu, Xiang
    Deng, Wenqin
    Chen, Xueyun
    IEEE ACCESS, 2024, 12 : 178507 - 178522
  • [43] Targeting goal-based decision-making for addiction recovery
    Verdejo-Garcia, Antonio
    Chong, Trevor T. -J.
    PHARMACOLOGY BIOCHEMISTRY AND BEHAVIOR, 2021, 210
  • [44] Decision-making model for sustainable supply chain finance under uncertainties
    Tseng, Ming-Lang
    Wu, Kuo-Jui
    Hu, Jiayao
    Wang, Chin-Hsin
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2018, 205 : 30 - 36
  • [45] Many objective robust decision-making model for agriculture decisions (MORDMAgro)
    Gonzalez, Xavier Ignacio
    Bert, Federico
    Podesta, Guillermo
    INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 2023, 30 (04) : 1617 - 1646
  • [46] Enterprise Management Decision-Making Evaluation Model and Its Empirical Study
    Yan, Xiaojun
    Chen, Zhiya
    PROCEEDINGS OF THE 6TH INTERNATIONAL ASIA CONFERENCE ON INDUSTRIAL ENGINEERING AND MANAGEMENT INNOVATION, VOL 2: INNOVATION AND PRACTICE OF INDUSTRIAL ENGINEERING AND MANAGMENT, 2016, : 827 - 834
  • [47] Investigating effects of group model building on sustainable design decision-making
    Watz, Matilda
    Johansson, Christian
    Bertoni, Alessandro
    Hallstedt, Sophie I.
    SUSTAINABLE PRODUCTION AND CONSUMPTION, 2022, 33 : 846 - 862
  • [48] Model-Based Decision Support in Manufacturing and Service Networks
    Fink, Andreas
    Kliewer, Natalia
    Mattfeld, Dirk
    Moench, Lars
    Rothlauf, Franz
    Schryen, Guido
    Suhl, Leena
    Voss, Stefan
    BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2014, 6 (01) : 17 - 24
  • [49] Autonomy and social influence in predictive genetic testing decision-making: A qualitative interview study
    Zimmermann, Bettina M.
    Kone, Insa
    Shaw, David
    Elger, Bernice
    BIOETHICS, 2021, 35 (02) : 199 - 206
  • [50] Model-Based and Graph-Based Priors for Group Testing
    Lau, Ivan
    Scarlett, Jonathan
    Sun, Yang
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 6035 - 6050