Integrating Safety Guarantees into the Learning Classifier System XCS

被引：0

作者：

Hansmeier, Tim ^{[1
]}

Platzner, Marco ^{[1
]}

机构：

[1] Paderborn Univ, Paderborn, Germany

来源：

APPLICATIONS OF EVOLUTIONARY COMPUTATION (EVOAPPLICATIONS 2022) | 2022年

关键词：

Safety; Safe reinforcement learning; LCS; XCS;

D O I：

10.1007/978-3-031-02462-7_25

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

On-line learning mechanisms are frequently employed to implement self-adaptivity in modern systems. With more widespread use in technical systems that interact with their physical environment, e.g. cyber-physical systems, the fulfillment of safety requirements is increasingly gaining attention. We focus on the learning classifier system XCS with its human-interpretable rules and propose an approach to integrate safety guarantees into its rule base. We leverage the interpretability of XCS' rules to internalize the safety-critical knowledge, as opposed to related work, which relies on an external safety monitor. The experimental evaluation shows that such manually injected knowledge not only gives safety guarantees but aids the learning mechanism of XCS. Especially in complex environments where XCS is struggling to find the optimal solution, the use of hand-crafted forbidden classifiers leads to a performance that is up to 41.7 % better than with an external safety monitor.

引用

页码：386 / 401

页数：16

共 50 条

[21] Improving genetic search in XCS-based classifier systems through understanding the evolvability of classifier rules
Muhammad Iqbal
Will N. Browne
Mengjie Zhang
Soft Computing, 2015, 19 : 1863 - 1880
[22] Evolving optimum populations with XCS classifier systemsXCS with code fragmented action
Muhammad Iqbal
Will N. Browne
Mengjie Zhang
Soft Computing, 2013, 17 : 503 - 518
[23] XCS on Embedded Systems: An Analysis of Execution Profiles and Accelerated Classifier Deletion
Brede, Mathis
Hansmeier, Tim
Platzner, Marco
PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 2071 - 2079
[24] Learning Classifier System with Deep Autoencoder
Matsumoto, Kazuma
Saito, Rei
Tajima, Yusuke
Nakata, Masaya
Sato, Hiroyuki
Kovacs, Tim
Takadama, Keiki
2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 4739 - 4746
[25] Toward a theory of generalization and learning in XCS
Butz, MV
Kovacs, T
Lanzi, PL
Wilson, SW
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2004, 8 (01) : 28 - 46
[26] Reinforcement Learning With Safety and Stability Guarantees During Exploration For Linear Systems
Marvi, Zahra
Kiumarsi, Bahare
IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2022, 1 : 322 - 334
[27] Learning classifier system with average reward reinforcement learning
Zang, Zhaoxiang
Li, Dehua
Wang, Junying
Xia, Dan
KNOWLEDGE-BASED SYSTEMS, 2013, 40 : 58 - 71
[28] Towards a More General XCS: Classifier Fusion and Don't Cares in Actions
Garza-Cuellar, Alejandro
Valenzuela-Rendon, Manuel
Parra-Alvarez, Ricardo-Javier
ADVANCES IN SOFT COMPUTING, MICAI 2016, PT II, 2017, 10062 : 200 - 210
[29] XCS-SL: a rule-based genetic learning system for sequence labeling
Nakata, Masaya
Kovacs, Tim
Takadama, Keiki
EVOLUTIONARY INTELLIGENCE, 2015, 8 (2-3) : 133 - 148
[30] SELF-ADAPTIVE LEARNING CLASSIFIER SYSTEM
Unold, Olgierd
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2010, 19 (01) : 275 - 296

← 1 2 3 4 5 →