Integrating Safety Guarantees into the Learning Classifier System XCS

被引:0
作者
Hansmeier, Tim [1 ]
Platzner, Marco [1 ]
机构
[1] Paderborn Univ, Paderborn, Germany
来源
APPLICATIONS OF EVOLUTIONARY COMPUTATION (EVOAPPLICATIONS 2022) | 2022年
关键词
Safety; Safe reinforcement learning; LCS; XCS;
D O I
10.1007/978-3-031-02462-7_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
On-line learning mechanisms are frequently employed to implement self-adaptivity in modern systems. With more widespread use in technical systems that interact with their physical environment, e.g. cyber-physical systems, the fulfillment of safety requirements is increasingly gaining attention. We focus on the learning classifier system XCS with its human-interpretable rules and propose an approach to integrate safety guarantees into its rule base. We leverage the interpretability of XCS' rules to internalize the safety-critical knowledge, as opposed to related work, which relies on an external safety monitor. The experimental evaluation shows that such manually injected knowledge not only gives safety guarantees but aids the learning mechanism of XCS. Especially in complex environments where XCS is struggling to find the optimal solution, the use of hand-crafted forbidden classifiers leads to a performance that is up to 41.7 % better than with an external safety monitor.
引用
收藏
页码:386 / 401
页数:16
相关论文
共 50 条
  • [31] Combining accuracy and success-rate to improve the performance of eXtended Classifier System (XCS) for data-mining and control applications
    Panahi, M. Shariat
    Yousefi, A. Karkhaneh
    Khorshidi, M.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (08) : 1930 - 1935
  • [32] DXCS: an XCS system for distributed data mining
    Dam, Hai H.
    Abbass, Hussein A.
    Lokan, Chris
    [J]. GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 1883 - 1890
  • [33] Refined group learning based on XCS and neural network in intelligent financial decision support system
    Li, Jung-Bin
    Chen, An-Pin
    [J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 925 - +
  • [34] A fuzzy system to control exploration rate in XCS
    Hamzeh, Ali
    Rahmani, Adel
    [J]. LEARNING CLASSIFIER SYSTEMS, 2007, 4399 : 115 - 127
  • [35] Resource management and scalability of the XCSF learning classifier system
    Stalph, Patrick O.
    Llora, Xavier
    Goldberg, David E.
    Butz, Martin V.
    [J]. THEORETICAL COMPUTER SCIENCE, 2012, 425 : 126 - 141
  • [36] Deep Reinforcement Learning with a Classifier System - First Steps
    Schoenberner, Connor
    Tomforde, Sven
    [J]. ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2022, 2022, 13642 : 256 - 270
  • [37] Evolution of control with learning classifier systems
    Karlsen M.R.
    Moschoyiannis S.
    [J]. Applied Network Science, 3 (1)
  • [38] Deterministic Safety Guarantees for Learning-Based Control of Monotone Nonlinear Systems Under Uncertainty
    Adamek, Joshua
    Heinlein, Moritz
    Lueken, Lukas
    Lucia, Sergio
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1030 - 1035
  • [39] A Cognitive Architecture Based on a Learning Classifier System with Spiking Classifiers
    Howard, David
    Bull, Larry
    Lanzi, Pier-Luca
    [J]. NEURAL PROCESSING LETTERS, 2016, 44 (01) : 125 - 147
  • [40] To Handle Real Valued Input in XCS: Using Fuzzy Hyper-trapezoidal Membership in Classifier Condition
    Shoeleh, Farzaneh
    Hamzeh, Ali
    Hashemi, Sattar
    [J]. SIMULATED EVOLUTION AND LEARNING, 2010, 6457 : 55 - 64