Integrating Safety Guarantees into the Learning Classifier System XCS

被引：0

作者：

Hansmeier, Tim ^{[1
]}

Platzner, Marco ^{[1
]}

机构：

[1] Paderborn Univ, Paderborn, Germany

来源：

APPLICATIONS OF EVOLUTIONARY COMPUTATION (EVOAPPLICATIONS 2022) | 2022年

关键词：

Safety; Safe reinforcement learning; LCS; XCS;

D O I：

10.1007/978-3-031-02462-7_25

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

On-line learning mechanisms are frequently employed to implement self-adaptivity in modern systems. With more widespread use in technical systems that interact with their physical environment, e.g. cyber-physical systems, the fulfillment of safety requirements is increasingly gaining attention. We focus on the learning classifier system XCS with its human-interpretable rules and propose an approach to integrate safety guarantees into its rule base. We leverage the interpretability of XCS' rules to internalize the safety-critical knowledge, as opposed to related work, which relies on an external safety monitor. The experimental evaluation shows that such manually injected knowledge not only gives safety guarantees but aids the learning mechanism of XCS. Especially in complex environments where XCS is struggling to find the optimal solution, the use of hand-crafted forbidden classifiers leads to a performance that is up to 41.7 % better than with an external safety monitor.

引用

页码：386 / 401

页数：16

共 50 条

[31] Combining accuracy and success-rate to improve the performance of eXtended Classifier System (XCS) for data-mining and control applications
Panahi, M. Shariat
Yousefi, A. Karkhaneh
Khorshidi, M.
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (08) : 1930 - 1935
[32] DXCS: an XCS system for distributed data mining
Dam, Hai H.
Abbass, Hussein A.
Lokan, Chris
[J]. GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 1883 - 1890
[33] Refined group learning based on XCS and neural network in intelligent financial decision support system
Li, Jung-Bin
Chen, An-Pin
[J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 925 - +
[34] A fuzzy system to control exploration rate in XCS
Hamzeh, Ali
Rahmani, Adel
[J]. LEARNING CLASSIFIER SYSTEMS, 2007, 4399 : 115 - 127
[35] Resource management and scalability of the XCSF learning classifier system
Stalph, Patrick O.
Llora, Xavier
Goldberg, David E.
Butz, Martin V.
[J]. THEORETICAL COMPUTER SCIENCE, 2012, 425 : 126 - 141
[36] Deep Reinforcement Learning with a Classifier System - First Steps
Schoenberner, Connor
Tomforde, Sven
[J]. ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2022, 2022, 13642 : 256 - 270
[37] Evolution of control with learning classifier systems
Karlsen M.R.
Moschoyiannis S.
[J]. Applied Network Science, 3 (1)
[38] Deterministic Safety Guarantees for Learning-Based Control of Monotone Nonlinear Systems Under Uncertainty
Adamek, Joshua
Heinlein, Moritz
Lueken, Lukas
Lucia, Sergio
[J]. IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1030 - 1035
[39] A Cognitive Architecture Based on a Learning Classifier System with Spiking Classifiers
Howard, David
Bull, Larry
Lanzi, Pier-Luca
[J]. NEURAL PROCESSING LETTERS, 2016, 44 (01) : 125 - 147
[40] To Handle Real Valued Input in XCS: Using Fuzzy Hyper-trapezoidal Membership in Classifier Condition
Shoeleh, Farzaneh
Hamzeh, Ali
Hashemi, Sattar
[J]. SIMULATED EVOLUTION AND LEARNING, 2010, 6457 : 55 - 64

← 1 2 3 4 5 →