Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm

被引:13
作者
Li, Ning [1 ,3 ,4 ]
Tang, Jichuan [1 ,2 ]
Li, Zhong-Xian [1 ,3 ,4 ]
Gao, Xiuyu [5 ]
机构
[1] Tianjin Univ, Sch Civil Engn, Tianjin 300350, Peoples R China
[2] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore, Singapore
[3] Tianjin Univ, Minist Educ, Key Lab Coast Civil Struct Safety, Tianjin, Peoples R China
[4] Tianjin Univ, Key Lab Earthquake Engn Simulat & Seism Resilienc, China Earthquake Adm, Tianjin, Peoples R China
[5] MTS Corp, Eden Prairie, MN USA
基金
国家重点研发计划;
关键词
deep deterministic policy gradient algorithm; hybrid control; real-time hybrid simulation; reinforcement learning; underwater shaking table; ACTUATOR CONTROL; COMPENSATION; PERFORMANCE;
D O I
10.1002/stc.3035
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The tracking performance of an actuation transfer system in a real-time hybrid simulation (RTHS) frequently faces accuracy and robustness challenges under constraints and complicated environments with uncertainties. This study proposes a novel control approach based on the deep deterministic policy gradient algorithm in reinforcement learning (RL) combined with feedforward (FF) compensation, which emphasizes the implementation of shaking table control and substructure RTHS. The proposed method first describes the control plant within the RL environment. Then, the agent is trained offline to develop optimized control policies for interaction with the environment. A series of validation tests were conducted to assess the performance of the proposed method, starting with the dynamic testing of underwater shaking table control and then a virtual RTHS benchmark problem. For complex systems, such as controlling the underwater shaking table, the proposed algorithm, FF, and adaptive time series (ATS) compensation methods are compared under various water depths and motions. The results show better performance and wider broadband frequency applicability under different shaking table dynamic-coupling effects. Next, a controller based on the proposed method was designed by extending the virtual RTHS via the configuration of the control plant and substructure division, as provided in the RTHS benchmark problem. The proposed RL controller also improved the tracking accuracy and robustness of conventional FF compensators against unmodeled dynamics and perturbation uncertainties. This controller can be extended to further advanced control strategies as a component of model-based control methods.
引用
收藏
页数:24
相关论文
共 43 条
[31]  
Silver D, 2014, PR MACH LEARN RES, V32
[32]   Towards Data-Driven Real-Time Hybrid Simulation: Adaptive Modeling of Control Plants [J].
Simpson, Thomas ;
Dertimanis, Vasilis K. ;
Chatzi, Eleni N. .
FRONTIERS IN BUILT ENVIRONMENT, 2020, 6
[33]   Conceptual Study of a Real-Time Hybrid Simulation Framework for Monopile Offshore Wind Turbines Under Wind and Wave Loads [J].
Song, Wei ;
Sun, Chao ;
Zuo, Yanhui ;
Jahangiri, Vahid ;
Lu, Yan ;
Han, Qinghua .
FRONTIERS IN BUILT ENVIRONMENT, 2020, 6
[34]  
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[35]   Performance extension of shaking table-based real-time dynamic hybrid testing through full state control via simulation [J].
Tang, Zhenyun ;
Dietz, Matt ;
Hong, Yue ;
Li, Zhenbao .
STRUCTURAL CONTROL & HEALTH MONITORING, 2020, 27 (10)
[36]   A study on a benchmark control problem for real-time hybrid simulation with a tracking error-based adaptive compensator combined with a supplementary proportional-integral-derivative controller [J].
Tao, Junjie ;
Mercan, Oya .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 134
[37]   Hybrid Simulation Method for a Structure Subjected to Fire and Its Application to a Steel Frame [J].
Wang, Xuguang ;
Kim, Robin E. ;
Kwon, Oh-Sung ;
Yeo, Inhwan .
JOURNAL OF STRUCTURAL ENGINEERING, 2018, 144 (08)
[38]   High performance compensation using an adaptive strategy for real-time hybrid simulation [J].
Wang, Zhen ;
Ning, Xizhan ;
Xu, Guoshan ;
Zhou, Huimeng ;
Wu, Bin .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 133
[39]   Real-Time Aerodynamics Hybrid Simulation: A Novel Wind-Tunnel Model for Flexible Bridges [J].
Wu, Teng ;
Li, Shaopeng ;
Sivaselvan, Mettupalayam .
JOURNAL OF ENGINEERING MECHANICS, 2019, 145 (09)
[40]   Reinforcement learning based optimal control of batch processes using Monte-Carlo deep deterministic policy gradient with phase segmentation [J].
Yoo, Haeun ;
Kim, Boeun ;
Kim, Jong Woo ;
Lee, Jay H. .
COMPUTERS & CHEMICAL ENGINEERING, 2021, 144