Influence Function Based Off-policy Q-learning Control for Markov Jump Systems
被引:0
|
作者:
Yuling Zou
论文数: 0引用数: 0
h-index: 0
机构:
Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), the School of Internet of Things EngineeringJiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), the School of Internet of Things Engineering
Yuling Zou
[1
]
Jiwei Wen
论文数: 0引用数: 0
h-index: 0
机构:
Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), the School of Internet of Things EngineeringJiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), the School of Internet of Things Engineering
Jiwei Wen
[1
]
Huiwen Xue
论文数: 0引用数: 0
h-index: 0
机构:
Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), the School of Internet of Things EngineeringJiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), the School of Internet of Things Engineering
Huiwen Xue
[1
]
Xiaoli Luan
论文数: 0引用数: 0
h-index: 0
机构:
Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), the School of Internet of Things EngineeringJiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), the School of Internet of Things Engineering
Xiaoli Luan
[1
]
机构:
[1] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), the School of Internet of Things Engineering
This paper presents an off-policy Q-learning approach based on influence function for addressing H∞ control of Markov jump systems. Unlike existing literatures, the mode classification and parallel update method is developed to directly decouple the relationship among matrices across different modes, tackling the most challenging aspect of this issue. Subsequently, we utilize the off-policy algorithm to derive the optimal policy, which allows for efficient learning without the need to follow the current policy being improved. This approach is particularly advantageous as it enables the algorithm to explore and evaluate different policies from historical data, thus circumventing the limitations associated with specific forms of disturbance updates. Moreover, the influence function is employed for data cleansing during the learning process, thereby enabling a more efficient learning period. A numerical example and a DC motor model are presented to illustrate the validity of the proposed method.
机构:
East China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R ChinaEast China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
Xue, Min
Yan, Huaicheng
论文数: 0引用数: 0
h-index: 0
机构:
East China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
Chengdu Univ, Sch Informat Sci & Engn, Chengdu 610106, Peoples R ChinaEast China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
Yan, Huaicheng
Zhang, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Tongji Univ, Dept Control Sci & Engn, Shanghai 200092, Peoples R ChinaEast China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
Zhang, Hao
Zhan, Xisheng
论文数: 0引用数: 0
h-index: 0
机构:
Hubei Normal Univ, Coll Mech & Control Engn, Huangshi 435002, Hubei, Peoples R ChinaEast China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
Zhan, Xisheng
Shi, Kaibo
论文数: 0引用数: 0
h-index: 0
机构:
Chengdu Univ, Sch Informat Sci & Engn, Chengdu 610106, Peoples R ChinaEast China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
机构:
Anhui Univ, Sch Elect Engn & Automat, Human Robot Integrat Syst & Intelligent Equipment, Hefei 230601, Peoples R ChinaAnhui Univ, Sch Elect Engn & Automat, Human Robot Integrat Syst & Intelligent Equipment, Hefei 230601, Peoples R China
Tu, Yidong
Fang, Haiyang
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Dept Mech & Automat Engn, Sha Tin, Hong Kong, Peoples R ChinaAnhui Univ, Sch Elect Engn & Automat, Human Robot Integrat Syst & Intelligent Equipment, Hefei 230601, Peoples R China
Fang, Haiyang
Wang, Hai
论文数: 0引用数: 0
h-index: 0
机构:
Murdoch Univ, Discipline Engn & Energy, Murdoch, WA, AustraliaAnhui Univ, Sch Elect Engn & Automat, Human Robot Integrat Syst & Intelligent Equipment, Hefei 230601, Peoples R China
Wang, Hai
Shi, Kaibo
论文数: 0引用数: 0
h-index: 0
机构:
Chengdu Univ, Sch Elect Informat & Elect Engn, Chengdu, Peoples R ChinaAnhui Univ, Sch Elect Engn & Automat, Human Robot Integrat Syst & Intelligent Equipment, Hefei 230601, Peoples R China
Shi, Kaibo
He, Shuping
论文数: 0引用数: 0
h-index: 0
机构:
Anhui Univ, Sch Elect Engn & Automat, Human Robot Integrat Syst & Intelligent Equipment, Hefei 230601, Peoples R ChinaAnhui Univ, Sch Elect Engn & Automat, Human Robot Integrat Syst & Intelligent Equipment, Hefei 230601, Peoples R China
机构:
Anhui Univ Technol, Anhui Prov Key Lab Special Heavy Load Robot, Maanshan 243032, Peoples R China
Anhui Univ Technol, Sch Elect & Informat Engn, Maanshan 243032, Peoples R ChinaAnhui Univ Technol, Anhui Prov Key Lab Special Heavy Load Robot, Maanshan 243032, Peoples R China
Wang, Jing
Peng, Chuanjun
论文数: 0引用数: 0
h-index: 0
机构:
Anhui Univ Technol, Anhui Prov Key Lab Special Heavy Load Robot, Maanshan 243032, Peoples R China
Anhui Univ Technol, Sch Elect & Informat Engn, Maanshan 243032, Peoples R ChinaAnhui Univ Technol, Anhui Prov Key Lab Special Heavy Load Robot, Maanshan 243032, Peoples R China
Peng, Chuanjun
Park, Ju H.
论文数: 0引用数: 0
h-index: 0
机构:
Yeungnam Univ, Dept Elect Engn, Gyongsan 38541, South KoreaAnhui Univ Technol, Anhui Prov Key Lab Special Heavy Load Robot, Maanshan 243032, Peoples R China
Park, Ju H.
Shen, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Anhui Univ Technol, Anhui Prov Key Lab Special Heavy Load Robot, Maanshan 243032, Peoples R China
Anhui Univ Technol, Sch Elect & Informat Engn, Maanshan 243032, Peoples R ChinaAnhui Univ Technol, Anhui Prov Key Lab Special Heavy Load Robot, Maanshan 243032, Peoples R China
Shen, Hao
Shi, Kaibo
论文数: 0引用数: 0
h-index: 0
机构:
Chengdu Univ, Sch Informat Sci & Engn, Chengdu 610106, Peoples R ChinaAnhui Univ Technol, Anhui Prov Key Lab Special Heavy Load Robot, Maanshan 243032, Peoples R China
机构:
Bohai Univ, Coll Control Sci & Engn, Jinzhou 121013, Liaoning, Peoples R ChinaBohai Univ, Coll Control Sci & Engn, Jinzhou 121013, Liaoning, Peoples R China
Tang, Fanghua
Wang, Huanqing
论文数: 0引用数: 0
h-index: 0
机构:
Bohai Univ, Coll Math Sci, Jinzhou 121013, Liaoning, Peoples R ChinaBohai Univ, Coll Control Sci & Engn, Jinzhou 121013, Liaoning, Peoples R China
Wang, Huanqing
Chang, Xiao-Heng
论文数: 0引用数: 0
h-index: 0
机构:
Bohai Univ, Coll Control Sci & Engn, Jinzhou 121013, Liaoning, Peoples R ChinaBohai Univ, Coll Control Sci & Engn, Jinzhou 121013, Liaoning, Peoples R China
Chang, Xiao-Heng
Zhang, Liang
论文数: 0引用数: 0
h-index: 0
机构:
Bohai Univ, Coll Control Sci & Engn, Jinzhou 121013, Liaoning, Peoples R ChinaBohai Univ, Coll Control Sci & Engn, Jinzhou 121013, Liaoning, Peoples R China
Zhang, Liang
Alharbi, Khalid H.
论文数: 0引用数: 0
h-index: 0
机构:
King Abdulaziz Univ, Fac Engn, Dept Elect & Comp Engn, Commun Syst & Networks Res Grp, Jeddah, Saudi ArabiaBohai Univ, Coll Control Sci & Engn, Jinzhou 121013, Liaoning, Peoples R China
机构:
Zhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Yuquan Campus, Hangzhou 310027, Zhejiang, Peoples R ChinaZhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Yuquan Campus, Hangzhou 310027, Zhejiang, Peoples R China
Zhang, Meng
Shi, Peng
论文数: 0引用数: 0
h-index: 0
机构:
Univ Adelaide, Sch Elect & Elect Engn, Adelaide, SA, AustraliaZhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Yuquan Campus, Hangzhou 310027, Zhejiang, Peoples R China
Shi, Peng
Liu, Zhitao
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Yuquan Campus, Hangzhou 310027, Zhejiang, Peoples R ChinaZhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Yuquan Campus, Hangzhou 310027, Zhejiang, Peoples R China
Liu, Zhitao
Cai, Jianping
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ Water Resources & Elect Power, Hangzhou, Zhejiang, Peoples R ChinaZhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Yuquan Campus, Hangzhou 310027, Zhejiang, Peoples R China
Cai, Jianping
Su, Hongye
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Yuquan Campus, Hangzhou 310027, Zhejiang, Peoples R ChinaZhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Yuquan Campus, Hangzhou 310027, Zhejiang, Peoples R China
机构:
Qingdao Univ, Sch Automat, Qingdao 266071, Peoples R China
Qingdao Univ, Shandong Key Lab Ind Control Technol, Qingdao 266071, Peoples R ChinaQingdao Univ, Sch Automat, Qingdao 266071, Peoples R China
Zhang, Junye
Liu, Zhen
论文数: 0引用数: 0
h-index: 0
机构:
Qingdao Univ, Sch Automat, Qingdao 266071, Peoples R China
Qingdao Univ, Shandong Key Lab Ind Control Technol, Qingdao 266071, Peoples R ChinaQingdao Univ, Sch Automat, Qingdao 266071, Peoples R China
Liu, Zhen
Jiang, Baoping
论文数: 0引用数: 0
h-index: 0
机构:
Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215000, Peoples R ChinaQingdao Univ, Sch Automat, Qingdao 266071, Peoples R China