Testing of Deep Reinforcement Learning Agents with Surrogate Models

被引:5
作者
Biagiola, Matteo [1 ]
Tonella, Paolo [1 ]
机构
[1] Univ Svizzera Italiana, 6900 Via Buffi 13, Lugano, Switzerland
基金
欧盟地平线“2020”;
关键词
Software testing; reinforcement learning;
D O I
10.1145/3631970
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep Reinforcement Learning (DRL) has received a lot of attention from the research community in recent years. As the technology moves away from game playing to practical contexts, such as autonomous vehicles and robotics, it is crucial to evaluate the quality of DRL agents. In this article, we propose a search-based approach to test such agents. Our approach, implemented in a tool called Indago, trains a classifier on failure and non-failure environment (i.e., pass) configurations resulting from the DRL training process. The classifier is used at testing time as a surrogate model for the DRL agent execution in the environment, predicting the extent to which a given environment configuration induces a failure of the DRL agent under test. The failure prediction acts as a fitness function, guiding the generation towards failure environment configurations, while saving computation time by deferring the execution of the DRL agent in the environment to those configurations that are more likely to expose failures. Experimental results show that our search-based approach finds 50% more failures of the DRL agent than state-of-the-art techniques. Moreover, such failures are, on average, 78% more diverse; similarly, the behaviors of the DRL agent induced by failure configurations are 74% more diverse.
引用
收藏
页数:33
相关论文
共 50 条
  • [1] Building Surrogate Models Using Trajectories of Agents Trained by Reinforcement Learning
    Cestero, Julen
    Quartulli, Marco
    Restelli, Marcello
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT IV, 2024, 15019 : 340 - 355
  • [2] A Search-Based Testing Approach for Deep Reinforcement Learning Agents
    Zolfagharian, Amirhossein
    Abdellatif, Manel
    Briand, Lionel C.
    Bagherzadeh, Mojtaba
    Ramesh, S.
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (07) : 3715 - 3735
  • [3] IFRIT: Focused Testing through Deep Reinforcement Learning
    Romdhana, Andrea
    Ceccato, Mariano
    Merlo, Alessio
    Tonella, Paolo
    2022 IEEE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST 2022), 2022, : 24 - 34
  • [4] Surrogate Models for Enhancing the Efficiency of Neuroevolution in Reinforcement Learning
    Stork, Joerg
    Zaefferer, Martin
    Bartz-Beielstein, Thomas
    Eiben, A. E.
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 934 - 942
  • [5] Deep learning, reinforcement learning, and world models
    Matsuo, Yutaka
    LeCun, Yann
    Sahani, Maneesh
    Precup, Doina
    Silver, David
    Sugiyama, Masashi
    Uchibe, Eiji
    Morimoto, Jun
    NEURAL NETWORKS, 2022, 152 : 267 - 275
  • [6] Learning key steps to attack deep reinforcement learning agents
    Yu, Chien-Min
    Chen, Ming-Hsin
    Lin, Hsuan-Tien
    MACHINE LEARNING, 2023, 112 (05) : 1499 - 1522
  • [7] Transferring task models in Reinforcement Learning agents
    Fachantidis, Anestis
    Partalas, Ioannis
    Tsoumakas, Grigorios
    Vlahavas, Ioannis
    NEUROCOMPUTING, 2013, 107 : 23 - 32
  • [8] Learning key steps to attack deep reinforcement learning agents
    Chien-Min Yu
    Ming-Hsin Chen
    Hsuan-Tien Lin
    Machine Learning, 2023, 112 : 1499 - 1522
  • [9] ON THE DEVELOPMENT OF AUTONOMOUS AGENTS USING DEEP REINFORCEMENT LEARNING
    Barbu, Clara
    Mocanu, Stefan Alexandru
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2021, 83 (03): : 97 - 116
  • [10] Augmenting Automated Game Testing with Deep Reinforcement Learning
    Bergdahl, Joakim
    Gordillo, Camilo
    Tollmar, Konrad
    Gisslen, Linus
    2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 600 - 603