Safe Neural Network Controller Synthesis and Verification for Hybrid Systems

被引:0
|
作者
Zhao Q.-Y. [1 ]
Wang Y. [1 ]
Li X.-D. [1 ]
机构
[1] State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing
来源
Ruan Jian Xue Bao/Journal of Software | 2023年 / 34卷 / 07期
关键词
barrier certificate; hybrid system; mixed-integer programming; neural network controller; safety control;
D O I
10.13328/j.cnki.jos.006857
中图分类号
学科分类号
摘要
Controller synthesis is a fundamental problem in hybrid system control. The synthesis of safe controllers is related to the use of hybrid systems in safety-critical fields. This paper proposes a novel approach to synthesizing neural network controllers with safety guarantees for hybrid systems. The safety of neural network controllers is guaranteed by barrier certificates, which are simultaneously synthesized with the controllers. To learn safe neural network controllers, first, the network structures of the controllers are determined, and the training datasets are constructed based on the hybrid system. Then, the loss function of network training is encoded based on the barrier certificate conditions guaranteeing the safety of the controllers. When the training process completes, the learned controllers are safe on training datasets but may not be safe on the whole hybrid system. To verify the safety of the learned controllers on the whole system, this paper transforms the certification of safety conditions into a group of mixed-integer programming problems and adopts the numerical optimization solver to get formally guaranteed results. We implement the safe neural network controller synthesis tool SafeNC and evaluate its performance on 8 benchmark systems. SafeNC successfully synthesizes large controllers with up to 6 hidden layers and 1804 neurons. The experimental results show that SafeNC can deal with more complex systems, and is more effective and scalable than the existing methods. © 2023 Chinese Academy of Sciences. All rights reserved.
引用
收藏
相关论文
共 70 条
  • [1] Maasoumy M, Pinto A, Sangiovanni-Vincentelli A., Model-based hierarchical optimal control design for HVAC systems, Dynamic Systems and Control Conference, pp. 271-278, (2011)
  • [2] Jin WX, Wang ZR, Yang ZR, Mou SH., Neural certificates for safe control policies, (2020)
  • [3] Taylor A, Singletary A, Yue YS, Ames A., Learning for safety-critical control with control barrier functions, Learning for Dynamics and Control, pp. 708-717, (2020)
  • [4] Psaltis D, Sideris A, Yamamura AA., A multilayered neural network controller, IEEE control systems magazine, 8, 2, pp. 17-21, (1988)
  • [5] Sha M, Chen X, Ji YZ, Zhao QY, Yang ZF, Lin W, Tang EY, Chen QG, Li XD., Synthesizing barrier certificates of neural network controlled continuous systems via approximations, 2021 58th ACM/IEEE Design Automation Conference (DAC), pp. 631-636, (2021)
  • [6] Bastani O., Safe reinforcement learning with nonlinear dynamics via model predictive shielding, 2021 American Control Conference (ACC)2021, pp. 3488-3494
  • [7] Zhao QY, Chen X, Zhang YF, Sha M, Yang ZF, Lin W, Tang EY, Chen QG, Li XD., Synthesizing ReLU neural networks with two hidden layers as barrier certificates for hybrid systems, Proceedings of the 24th International Conference on Hybrid Systems: Computation and Control, pp. 1-11, (2021)
  • [8] Cybenko G., Approximation by superpositions of a sigmoidal function, Mathematics of control, signals and systems, 2, 4, pp. 303-314, (1989)
  • [9] Hornik K, Stinchcombe M, White H., Multilayer feedforward networks are universal approximators, Neural networks, 2, 5, pp. 359-366, (1989)
  • [10] Liu J, Zhan NJ, Zhao HJ., Computing semi-algebraic invariants for polynomial dynamical systems, Proceedings of the ninth ACM international conference on Embedded software, pp. 97-106, (2011)