Generative Model-Based Testing on Decision-Making Policies

被引:4
|
作者
Li, Zhuo [1 ]
Wu, Xiongfei [1 ]
Zhu, Derui [2 ]
Cheng, Mingfei [3 ]
Chen, Siyuan [1 ]
Zhang, Fuyuan [1 ]
Xie, Xiaofei [3 ]
Ma, Lei [4 ,5 ]
Zhao, Jianjun [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Tech Univ Munich, Munich, Germany
[3] Singapore Management Univ, Singapore, Singapore
[4] Univ Tokyo, Tokyo, Japan
[5] Univ Alberta, Edmonton, AB, Canada
来源
2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE | 2023年
基金
新加坡国家研究基金会; 加拿大自然科学与工程研究理事会;
关键词
generative model; testing; decision-making policies; COMPREHENSIVE SURVEY; REINFORCEMENT; SYSTEMS; GO;
D O I
10.1109/ASE56229.2023.00153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reliability of decision-making policies is urgently important today as they have established the fundamentals of many critical applications, such as autonomous driving and robotics. To ensure reliability, there have been a number of research efforts on testing decision-making policies that solve Markov decision processes (MDPs). However, due to the deep neural network (DNN)-based inherit and infinite state space, developing scalable and effective testing frameworks for decision-making policies still remains open and challenging. In this paper, we present an effective testing framework for decision-making policies. The framework adopts a generative diffusion model-based test case generator that can easily adapt to different search spaces, ensuring the practicality and validity of test cases. Then, we propose a termination state novelty-based guidance to diversify agent behaviors and improve the test effectiveness. Finally, we evaluate the framework on five widely used benchmarks, including autonomous driving, aircraft collision avoidance, and gaming scenarios. The results demonstrate that our approach identifies more diverse and influential failure-triggering test cases compared to current state-of-the-art techniques. Moreover, we employ the detected failure cases to repair the evaluated models, achieving better robustness enhancement compared to the baseline method.
引用
收藏
页码:243 / 254
页数:12
相关论文
共 50 条
  • [1] Working Memory Guides Action Valuation in Model-based Decision-making Strategy
    Zuo, Zhaoyu
    Yang, Li-Zhuang
    Wang, Hongzhi
    Li, Hai
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2025, 37 (01) : 86 - 96
  • [2] Spontaneous mind wandering impairs model-based decision making
    Liu, Shuyan
    Rabovsky, Milena
    Schad, Daniel J.
    PLOS ONE, 2023, 18 (01):
  • [3] Computational and behavioral markers of model-based decision making in childhood
    Smid, Claire R.
    Kool, Wouter
    Hauser, Tobias U.
    Steinbeis, Nikolaus
    DEVELOPMENTAL SCIENCE, 2023, 26 (02)
  • [4] Linking Agricultural Policies with Decision-Making: A Spatial Approach
    Vaz, Eric
    Painho, Marco
    Nijkamp, Peter
    EUROPEAN PLANNING STUDIES, 2015, 23 (04) : 733 - 745
  • [5] Model-based fMRI and its application to reward learning and decision making
    O'Doherty, John P.
    Hampton, Alan
    Kim, Hackjin
    REWARD AND DECISION MAKING IN CORTICOBASAL GANGLIA NETWORKS, 2007, 1104 : 35 - 53
  • [6] Understanding neural coding through the model-based analysis of decision making
    Corrado, Greg
    Doya, Kenji
    JOURNAL OF NEUROSCIENCE, 2007, 27 (31) : 8178 - 8180
  • [7] Decision-making of organic food production based on a quality investment model
    Chen, Yusheng
    Qiao, Juan
    Cheng, Li
    Fang, Ruijing
    PROCEEDINGS OF 2009 CAER INTERNATIONAL ANNUAL CONFERENCE ON GLOBALIZATION AND CHINA'S AGRICULTURAL DEVELOPMENT, 2009, : 241 - 252
  • [8] A DECISION-MAKING MODEL FOR CONTROLLING LOGISTICS COSTS
    Skerlic, Sebastjan
    Muha, Robert
    Logozar, Klavdij
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2016, 23 (01): : 145 - 156
  • [9] Model Learning and Model-Based Testing
    Aichernig, Bernhard K.
    Mostowski, Wojciech
    Mousavi, Mohammad Reza
    Tappler, Martin
    Taromirad, Masoumeh
    MACHINE LEARNING FOR DYNAMIC SOFTWARE ANALYSIS: POTENTIALS AND LIMITS, 2018, 11026 : 74 - 100
  • [10] Decision-making in brains and robots - the case for an interdisciplinary approach
    Lee, Sang Wan
    Seymour, Ben
    CURRENT OPINION IN BEHAVIORAL SCIENCES, 2019, 26 : 137 - 145