Generative Model-Based Testing on Decision-Making Policies

被引:4
|
作者
Li, Zhuo [1 ]
Wu, Xiongfei [1 ]
Zhu, Derui [2 ]
Cheng, Mingfei [3 ]
Chen, Siyuan [1 ]
Zhang, Fuyuan [1 ]
Xie, Xiaofei [3 ]
Ma, Lei [4 ,5 ]
Zhao, Jianjun [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Tech Univ Munich, Munich, Germany
[3] Singapore Management Univ, Singapore, Singapore
[4] Univ Tokyo, Tokyo, Japan
[5] Univ Alberta, Edmonton, AB, Canada
来源
2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE | 2023年
基金
新加坡国家研究基金会; 加拿大自然科学与工程研究理事会;
关键词
generative model; testing; decision-making policies; COMPREHENSIVE SURVEY; REINFORCEMENT; SYSTEMS; GO;
D O I
10.1109/ASE56229.2023.00153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reliability of decision-making policies is urgently important today as they have established the fundamentals of many critical applications, such as autonomous driving and robotics. To ensure reliability, there have been a number of research efforts on testing decision-making policies that solve Markov decision processes (MDPs). However, due to the deep neural network (DNN)-based inherit and infinite state space, developing scalable and effective testing frameworks for decision-making policies still remains open and challenging. In this paper, we present an effective testing framework for decision-making policies. The framework adopts a generative diffusion model-based test case generator that can easily adapt to different search spaces, ensuring the practicality and validity of test cases. Then, we propose a termination state novelty-based guidance to diversify agent behaviors and improve the test effectiveness. Finally, we evaluate the framework on five widely used benchmarks, including autonomous driving, aircraft collision avoidance, and gaming scenarios. The results demonstrate that our approach identifies more diverse and influential failure-triggering test cases compared to current state-of-the-art techniques. Moreover, we employ the detected failure cases to repair the evaluated models, achieving better robustness enhancement compared to the baseline method.
引用
收藏
页码:243 / 254
页数:12
相关论文
共 50 条
  • [31] Risk, reward, and decision-making in a rodent model of cognitive aging
    Gilbert, Ryan J.
    Mitchell, Marci R.
    Simon, Nicholas W.
    Banuelos, Cristina
    Setlow, Barry
    Bizon, Jennifer L.
    FRONTIERS IN NEUROSCIENCE, 2012, 6
  • [32] IT Governance, Decision-Making and IT Capabilities
    Hiekkanen, Kari
    Korhonen, Janne
    Patricio, Elisabete
    Helenius, Mika
    Collin, Jari
    PROCEEDINGS OF THE 9TH EUROPEAN CONFERENCE ON MANAGEMENT LEADERSHIP AND GOVERNANCE, 2013, : 92 - 99
  • [33] Design of Decision-Making Organizations
    Christensen, Michael
    Knudsen, Thorbjorn
    MANAGEMENT SCIENCE, 2010, 56 (01) : 71 - 89
  • [34] The structure of development decision-making
    Brugha, CM
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 104 (01) : 77 - 92
  • [35] The structure of adjustment decision-making
    Brugha, CM
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 104 (01) : 63 - 76
  • [36] Developing a model for identifying successful petrochemical projects based on Multiple Criteria Decision-Making approach
    Toloie-Eshlaghy, Abbas
    Homayonfar, Mandi
    Motadel, Mohammadreza
    Afshar-Kazemi, Mohammadali
    INNOVATION, MANAGEMENT AND SERVICE, ICMS 2011, 2011, 14 : 243 - 248
  • [37] Risk Analysis and Utility Function-Based Decision-Making Model for Spinning Reserve Allocations
    Ye, Lun
    Yao, Jiangang
    Ouyang, Xu
    Zhu, Xiangqian
    Yang, Shengjie
    IEEE ACCESS, 2021, 9 : 18752 - 18761
  • [38] The structure of qualitative decision-making
    Brugha, CM
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 104 (01) : 46 - 62
  • [39] Decision support for participatory wetland decision-making
    Goosen, Hasse
    Janssen, Ron
    Vermaat, Jan E.
    ECOLOGICAL ENGINEERING, 2007, 30 (02) : 187 - 199
  • [40] ControlVAE: Model-Based Learning of Generative Controllers for Physics-Based Characters
    Yao, Heyuan
    Song, Zhenhua
    Chen, Baoquan
    Liu, Libin
    ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (06):