Integrating Safety Guarantees into the Learning Classifier System XCS

被引:0
作者
Hansmeier, Tim [1 ]
Platzner, Marco [1 ]
机构
[1] Paderborn Univ, Paderborn, Germany
来源
APPLICATIONS OF EVOLUTIONARY COMPUTATION (EVOAPPLICATIONS 2022) | 2022年
关键词
Safety; Safe reinforcement learning; LCS; XCS;
D O I
10.1007/978-3-031-02462-7_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
On-line learning mechanisms are frequently employed to implement self-adaptivity in modern systems. With more widespread use in technical systems that interact with their physical environment, e.g. cyber-physical systems, the fulfillment of safety requirements is increasingly gaining attention. We focus on the learning classifier system XCS with its human-interpretable rules and propose an approach to integrate safety guarantees into its rule base. We leverage the interpretability of XCS' rules to internalize the safety-critical knowledge, as opposed to related work, which relies on an external safety monitor. The experimental evaluation shows that such manually injected knowledge not only gives safety guarantees but aids the learning mechanism of XCS. Especially in complex environments where XCS is struggling to find the optimal solution, the use of hand-crafted forbidden classifiers leads to a performance that is up to 41.7 % better than with an external safety monitor.
引用
收藏
页码:386 / 401
页数:16
相关论文
共 50 条
  • [21] Improving genetic search in XCS-based classifier systems through understanding the evolvability of classifier rules
    Muhammad Iqbal
    Will N. Browne
    Mengjie Zhang
    Soft Computing, 2015, 19 : 1863 - 1880
  • [22] Evolving optimum populations with XCS classifier systemsXCS with code fragmented action
    Muhammad Iqbal
    Will N. Browne
    Mengjie Zhang
    Soft Computing, 2013, 17 : 503 - 518
  • [23] XCS on Embedded Systems: An Analysis of Execution Profiles and Accelerated Classifier Deletion
    Brede, Mathis
    Hansmeier, Tim
    Platzner, Marco
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 2071 - 2079
  • [24] Learning Classifier System with Deep Autoencoder
    Matsumoto, Kazuma
    Saito, Rei
    Tajima, Yusuke
    Nakata, Masaya
    Sato, Hiroyuki
    Kovacs, Tim
    Takadama, Keiki
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 4739 - 4746
  • [25] Toward a theory of generalization and learning in XCS
    Butz, MV
    Kovacs, T
    Lanzi, PL
    Wilson, SW
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2004, 8 (01) : 28 - 46
  • [26] Reinforcement Learning With Safety and Stability Guarantees During Exploration For Linear Systems
    Marvi, Zahra
    Kiumarsi, Bahare
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2022, 1 : 322 - 334
  • [27] Learning classifier system with average reward reinforcement learning
    Zang, Zhaoxiang
    Li, Dehua
    Wang, Junying
    Xia, Dan
    KNOWLEDGE-BASED SYSTEMS, 2013, 40 : 58 - 71
  • [28] Towards a More General XCS: Classifier Fusion and Don't Cares in Actions
    Garza-Cuellar, Alejandro
    Valenzuela-Rendon, Manuel
    Parra-Alvarez, Ricardo-Javier
    ADVANCES IN SOFT COMPUTING, MICAI 2016, PT II, 2017, 10062 : 200 - 210
  • [29] XCS-SL: a rule-based genetic learning system for sequence labeling
    Nakata, Masaya
    Kovacs, Tim
    Takadama, Keiki
    EVOLUTIONARY INTELLIGENCE, 2015, 8 (2-3) : 133 - 148
  • [30] SELF-ADAPTIVE LEARNING CLASSIFIER SYSTEM
    Unold, Olgierd
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2010, 19 (01) : 275 - 296