In this paper, we propose a neural-network (NN)-based online off-policy algorithm to optimize a class of nonlinear continuous-time time-delay systems during finite time horizon. The online off-policy algorithm is used to learn the two-stage solution to the time-varying Hamilton-Jacobi-Bellman (HJB) equation without requiring the knowledge of the time-delay system dynamics. The algorithm is implemented by using an actor-critic NN structure with time-varying activation functions. The weights of the two NNs are tuned simultaneously in real-time by considering both the residual error and the terminal error. Two simulation examples demonstrate the applicability of the proposed algorithm. (C) 2017 Elsevier B.V. All rights reserved.
机构:
E China Univ Sci & Technol, Dept Automat, Shanghai 200237, Peoples R ChinaE China Univ Sci & Technol, Dept Automat, Shanghai 200237, Peoples R China
Du, Hongbin
;
Chen, Xiaochuan
论文数: 0引用数: 0
h-index: 0
机构:
Donghua Univ, Mech Engn Coll, Shanghai 201620, Peoples R ChinaE China Univ Sci & Technol, Dept Automat, Shanghai 200237, Peoples R China
机构:
Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R ChinaBeijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
Gao, Shigen
;
Ning, Bin
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R ChinaBeijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
Ning, Bin
;
Dong, Hairong
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R ChinaBeijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
机构:
E China Univ Sci & Technol, Dept Automat, Shanghai 200237, Peoples R ChinaE China Univ Sci & Technol, Dept Automat, Shanghai 200237, Peoples R China
Du, Hongbin
;
Chen, Xiaochuan
论文数: 0引用数: 0
h-index: 0
机构:
Donghua Univ, Mech Engn Coll, Shanghai 201620, Peoples R ChinaE China Univ Sci & Technol, Dept Automat, Shanghai 200237, Peoples R China
机构:
Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R ChinaBeijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
Gao, Shigen
;
Ning, Bin
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R ChinaBeijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
Ning, Bin
;
Dong, Hairong
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R ChinaBeijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China