Adaptive optimal consensus of nonlinear multi-agent systems with unknown dynamics using off-policy integral reinforcement learning
被引:0
作者:
Yan, Lei
论文数: 0引用数: 0
h-index: 0
机构:
Nanyang Inst Technol, Sch Intelligent Mfg, Nanyang 473004, Henan, Peoples R ChinaNanyang Inst Technol, Sch Intelligent Mfg, Nanyang 473004, Henan, Peoples R China
Yan, Lei
[1
]
Liu, Zhi
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R China
Pazhou Lab, Guangzhou 510006, Guangdong, Peoples R ChinaNanyang Inst Technol, Sch Intelligent Mfg, Nanyang 473004, Henan, Peoples R China
Liu, Zhi
[2
,4
]
Chen, C. L. Philip
论文数: 0引用数: 0
h-index: 0
机构:
South China Univ Technol, Fac Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R ChinaNanyang Inst Technol, Sch Intelligent Mfg, Nanyang 473004, Henan, Peoples R China
Chen, C. L. Philip
[3
]
Zhang, Yun
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R ChinaNanyang Inst Technol, Sch Intelligent Mfg, Nanyang 473004, Henan, Peoples R China
Zhang, Yun
[2
]
Wu, Zongze
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R ChinaNanyang Inst Technol, Sch Intelligent Mfg, Nanyang 473004, Henan, Peoples R China
Wu, Zongze
[2
]
机构:
[1] Nanyang Inst Technol, Sch Intelligent Mfg, Nanyang 473004, Henan, Peoples R China
[2] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R China
[3] South China Univ Technol, Fac Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
[4] Pazhou Lab, Guangzhou 510006, Guangdong, Peoples R China
Reinforcement learning (RL) has been identified as a promising approach for developing adaptive optimal consensus schemes for high-order strict-feedback nonlinear multi-agent systems (MASs). However, existing methods have limitations, as they can only be applied to systems with partially unknown dynamics and require an identifier-actor-critic framework. This paper proposes a novel approach that combines classical backstepping techniques and off-policy integral reinforcement learning (IRL) to circumvent these limitations and develop an adaptive optimal consensus scheme for nonlinear MASs with completely unknown dynamics. Specifically, we introduce an off-policy IRL-based adaptive optimal consensus scheme that can obtain optimal control inputs without knowledge of the system dynamics. The algorithm utilizes the actor-critic structure and updates the weight vectors using only one learning rule in each step based on the collected system trajectory data. We have proven that the optimal consensus is achieved, and the estimation errors of the optimal weight vectors are uniformly ultimately bounded (UUB). Finally, we present a simulation example to validate the effectiveness of the proposed approach.
机构:
Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
Guangdong Key Lab IoT Informat Technol, Guangzhou, Peoples R ChinaGuangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
Chen, Ci
Xie, Lihua
论文数: 0引用数: 0
h-index: 0
机构:
Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, SingaporeGuangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
Xie, Lihua
Xie, Kan
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
111 Ctr Intelligent Batch Mfg Based IoT Technol, Guangzhou, Peoples R ChinaGuangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
Xie, Kan
Lewis, Frank L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Texas Arlington, UTA Res Inst, Ft Worth, TX USAGuangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
Lewis, Frank L.
Xie, Shengli
论文数: 0引用数: 0
h-index: 0
机构:
Minist Educ, Key Lab Intelligent Informat Proc & Syst Integrat, Guangzhou, Peoples R China
Guangdong HongKong Macao Joint Lab Smart Discrete, Guangzhou, Peoples R ChinaGuangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
机构:
Guangdong University of Technology, School of Automation, Guangzhou
Guangdong University of Technology, Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, GuangzhouGuangdong University of Technology, School of Automation, Guangzhou
Guo Z.
Ren H.
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong University of Technology, School of Automation, Guangzhou
Guangdong University of Technology, Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, Guangzhou
Peng Cheng Laboratory, Department of New Networks, ShenzhenGuangdong University of Technology, School of Automation, Guangzhou
Ren H.
Li H.
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong University of Technology, School of Automation, Guangzhou
Guangdong University of Technology, Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, GuangzhouGuangdong University of Technology, School of Automation, Guangzhou
Li H.
Zhou Q.
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong University of Technology, School of Automation, Guangzhou
Guangdong University of Technology, Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, GuangzhouGuangdong University of Technology, School of Automation, Guangzhou
机构:
Peking Univ, Coll Engn, Intelligent Control Lab, Beijing 100871, Peoples R ChinaPeking Univ, Coll Engn, Intelligent Control Lab, Beijing 100871, Peoples R China
Jia, Yongnan
Wang, Long
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Coll Engn, Intelligent Control Lab, Beijing 100871, Peoples R ChinaPeking Univ, Coll Engn, Intelligent Control Lab, Beijing 100871, Peoples R China
机构:
Northeastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R ChinaNortheastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
Jiang, Yi
Gao, Weinan
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R ChinaNortheastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
Gao, Weinan
Wu, Jin
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R ChinaNortheastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
Wu, Jin
Chai, Tianyou
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R ChinaNortheastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
Chai, Tianyou
Lewis, Frank L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Texas Arlington, UTA Res Inst, Ft Worth, TX 76118 USANortheastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
机构:
Univ Texas Arlington, Automat & Robot Res Inst, Arlington, TX USA
S China Univ Technol, Guangzhou, Guangdong, Peoples R China
Shanghai Jiao Tong Univ, Shanghai, Peoples R ChinaUniv Texas Arlington, Automat & Robot Res Inst, Arlington, TX USA
Lewis, Frank L.
Vrabie, Draguna
论文数: 0引用数: 0
h-index: 0
机构:
Univ Texas Arlington, Automat & Robot Res Inst, Arlington, TX USAUniv Texas Arlington, Automat & Robot Res Inst, Arlington, TX USA
机构:
Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
Guangdong Key Lab IoT Informat Technol, Guangzhou, Peoples R ChinaGuangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
Chen, Ci
Xie, Lihua
论文数: 0引用数: 0
h-index: 0
机构:
Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, SingaporeGuangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
Xie, Lihua
Xie, Kan
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
111 Ctr Intelligent Batch Mfg Based IoT Technol, Guangzhou, Peoples R ChinaGuangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
Xie, Kan
Lewis, Frank L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Texas Arlington, UTA Res Inst, Ft Worth, TX USAGuangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
Lewis, Frank L.
Xie, Shengli
论文数: 0引用数: 0
h-index: 0
机构:
Minist Educ, Key Lab Intelligent Informat Proc & Syst Integrat, Guangzhou, Peoples R China
Guangdong HongKong Macao Joint Lab Smart Discrete, Guangzhou, Peoples R ChinaGuangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
机构:
Guangdong University of Technology, School of Automation, Guangzhou
Guangdong University of Technology, Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, GuangzhouGuangdong University of Technology, School of Automation, Guangzhou
Guo Z.
Ren H.
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong University of Technology, School of Automation, Guangzhou
Guangdong University of Technology, Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, Guangzhou
Peng Cheng Laboratory, Department of New Networks, ShenzhenGuangdong University of Technology, School of Automation, Guangzhou
Ren H.
Li H.
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong University of Technology, School of Automation, Guangzhou
Guangdong University of Technology, Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, GuangzhouGuangdong University of Technology, School of Automation, Guangzhou
Li H.
Zhou Q.
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong University of Technology, School of Automation, Guangzhou
Guangdong University of Technology, Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, GuangzhouGuangdong University of Technology, School of Automation, Guangzhou
机构:
Peking Univ, Coll Engn, Intelligent Control Lab, Beijing 100871, Peoples R ChinaPeking Univ, Coll Engn, Intelligent Control Lab, Beijing 100871, Peoples R China
Jia, Yongnan
Wang, Long
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Coll Engn, Intelligent Control Lab, Beijing 100871, Peoples R ChinaPeking Univ, Coll Engn, Intelligent Control Lab, Beijing 100871, Peoples R China
机构:
Northeastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R ChinaNortheastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
Jiang, Yi
Gao, Weinan
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R ChinaNortheastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
Gao, Weinan
Wu, Jin
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R ChinaNortheastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
Wu, Jin
Chai, Tianyou
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R ChinaNortheastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
Chai, Tianyou
Lewis, Frank L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Texas Arlington, UTA Res Inst, Ft Worth, TX 76118 USANortheastern Univ, State Key Lab Synthet Automation Proc Ind & Int, Joint Res Lab Integrated Automat, Shenyang 110819, Peoples R China
机构:
Univ Texas Arlington, Automat & Robot Res Inst, Arlington, TX USA
S China Univ Technol, Guangzhou, Guangdong, Peoples R China
Shanghai Jiao Tong Univ, Shanghai, Peoples R ChinaUniv Texas Arlington, Automat & Robot Res Inst, Arlington, TX USA
Lewis, Frank L.
Vrabie, Draguna
论文数: 0引用数: 0
h-index: 0
机构:
Univ Texas Arlington, Automat & Robot Res Inst, Arlington, TX USAUniv Texas Arlington, Automat & Robot Res Inst, Arlington, TX USA