Generative Model-Based Testing on Decision-Making Policies

被引:4
|
作者
Li, Zhuo [1 ]
Wu, Xiongfei [1 ]
Zhu, Derui [2 ]
Cheng, Mingfei [3 ]
Chen, Siyuan [1 ]
Zhang, Fuyuan [1 ]
Xie, Xiaofei [3 ]
Ma, Lei [4 ,5 ]
Zhao, Jianjun [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Tech Univ Munich, Munich, Germany
[3] Singapore Management Univ, Singapore, Singapore
[4] Univ Tokyo, Tokyo, Japan
[5] Univ Alberta, Edmonton, AB, Canada
来源
2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE | 2023年
基金
新加坡国家研究基金会; 加拿大自然科学与工程研究理事会;
关键词
generative model; testing; decision-making policies; COMPREHENSIVE SURVEY; REINFORCEMENT; SYSTEMS; GO;
D O I
10.1109/ASE56229.2023.00153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reliability of decision-making policies is urgently important today as they have established the fundamentals of many critical applications, such as autonomous driving and robotics. To ensure reliability, there have been a number of research efforts on testing decision-making policies that solve Markov decision processes (MDPs). However, due to the deep neural network (DNN)-based inherit and infinite state space, developing scalable and effective testing frameworks for decision-making policies still remains open and challenging. In this paper, we present an effective testing framework for decision-making policies. The framework adopts a generative diffusion model-based test case generator that can easily adapt to different search spaces, ensuring the practicality and validity of test cases. Then, we propose a termination state novelty-based guidance to diversify agent behaviors and improve the test effectiveness. Finally, we evaluate the framework on five widely used benchmarks, including autonomous driving, aircraft collision avoidance, and gaming scenarios. The results demonstrate that our approach identifies more diverse and influential failure-triggering test cases compared to current state-of-the-art techniques. Moreover, we employ the detected failure cases to repair the evaluated models, achieving better robustness enhancement compared to the baseline method.
引用
收藏
页码:243 / 254
页数:12
相关论文
共 50 条
  • [21] Risk-based decision-making support model for offshore dynamic positioning operations
    Hogenboom, Sandra
    Vinnem, Jan Erik
    Utne, Ingrid B.
    Kongsvik, Trond
    SAFETY SCIENCE, 2021, 140
  • [22] Survey of Model-Based Security Testing Approaches in the Automotive Domain
    Sommer, Florian
    Kriesten, Reiner
    Kargl, Frank
    IEEE ACCESS, 2023, 11 : 55474 - 55514
  • [23] Cyber Evaluation and Management Toolkit (CEMT): Face Validity of Model-Based Cybersecurity Decision Making
    Fowler, Stuart
    Joiner, Keith
    Ma, Siqi
    SYSTEMS, 2024, 12 (07):
  • [24] Autonomous Air Combat Maneuver Decision-Making Based on PPO-BWDA
    Wang, Hongming
    Zhou, Zhuangfeng
    Jiang, Junzhe
    Deng, Wenqin
    Chen, Xueyun
    IEEE ACCESS, 2024, 12 : 119116 - 119132
  • [25] A Model for Linguistic Dynamic Multi-criteria Decision-Making
    Jiang, Le
    Liu, Hongbin
    Martinez, Luis
    Cai, Jianfeng
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2013), 2014, 277 : 939 - 949
  • [26] Proposal of a hybrid decision-making model for the alignment of the environmental performance
    Boulagouas, Wafa
    Chaib, Rachid
    Djebabra, Mebarek
    MANAGEMENT OF ENVIRONMENTAL QUALITY, 2020, 31 (06) : 1603 - 1622
  • [27] An Integrated Decision-Making Model for the Location of a PV Solar Plant
    Lee, Amy H. I.
    Kang, He-Yau
    Lin, Chun-Yu
    Shen, Kuan-Chin
    SUSTAINABILITY, 2015, 7 (10) : 13522 - 13541
  • [28] [Invited] Generative Model-Based Text-to-Speech Synthesis
    Zen, Heiga
    2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 327 - 328
  • [29] Supply chain performance: a novel integrated decision-making model
    Zhong, Jianlan
    Cheng, Han
    Gholami, Hamed
    Letchumanan, L. Thiruvarasu
    Toptanci, Sura
    MANAGEMENT DECISION, 2023, 61 (10) : 3053 - 3081
  • [30] Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model
    Li, Gen
    Wei, Yuting
    Chi, Yuejie
    Chen, Yuxin
    OPERATIONS RESEARCH, 2024, 72 (01) : 203 - 221