Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm

被引：13

作者：

Li, Ning ^{[1
,3
,4
]}

Tang, Jichuan ^{[1
,2
]}

Li, Zhong-Xian ^{[1
,3
,4
]}

Gao, Xiuyu ^{[5
]}

机构：

[1] Tianjin Univ, Sch Civil Engn, Tianjin 300350, Peoples R China

[2] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore, Singapore

[3] Tianjin Univ, Minist Educ, Key Lab Coast Civil Struct Safety, Tianjin, Peoples R China

[4] Tianjin Univ, Key Lab Earthquake Engn Simulat & Seism Resilienc, China Earthquake Adm, Tianjin, Peoples R China

[5] MTS Corp, Eden Prairie, MN USA

来源：

STRUCTURAL CONTROL & HEALTH MONITORING | 2022年 / 29卷 / 10期

基金：

国家重点研发计划;

关键词：

deep deterministic policy gradient algorithm; hybrid control; real-time hybrid simulation; reinforcement learning; underwater shaking table; ACTUATOR CONTROL; COMPENSATION; PERFORMANCE;

D O I：

10.1002/stc.3035

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

The tracking performance of an actuation transfer system in a real-time hybrid simulation (RTHS) frequently faces accuracy and robustness challenges under constraints and complicated environments with uncertainties. This study proposes a novel control approach based on the deep deterministic policy gradient algorithm in reinforcement learning (RL) combined with feedforward (FF) compensation, which emphasizes the implementation of shaking table control and substructure RTHS. The proposed method first describes the control plant within the RL environment. Then, the agent is trained offline to develop optimized control policies for interaction with the environment. A series of validation tests were conducted to assess the performance of the proposed method, starting with the dynamic testing of underwater shaking table control and then a virtual RTHS benchmark problem. For complex systems, such as controlling the underwater shaking table, the proposed algorithm, FF, and adaptive time series (ATS) compensation methods are compared under various water depths and motions. The results show better performance and wider broadband frequency applicability under different shaking table dynamic-coupling effects. Next, a controller based on the proposed method was designed by extending the virtual RTHS via the configuration of the control plant and substructure division, as provided in the RTHS benchmark problem. The proposed RL controller also improved the tracking accuracy and robustness of conventional FF compensators against unmodeled dynamics and perturbation uncertainties. This controller can be extended to further advanced control strategies as a component of model-based control methods.

引用

页数：24

共 43 条

[31]

Silver D, 2014, PR MACH LEARN RES, V32

[32] Towards Data-Driven Real-Time Hybrid Simulation: Adaptive Modeling of Control Plants [J].