Human-Machine Shared Stabilization Control Based on Safe Adaptive Dynamic Programming With Bounded Rationality

被引:5
作者
Tan, Junkai [1 ,2 ,3 ]
Wang, Jingcheng [4 ]
Xue, Shuangsi [1 ,2 ,3 ]
Cao, Hui [1 ,2 ,3 ]
Li, Huan [1 ,2 ,3 ]
Guo, Zihang [1 ,2 ,3 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect Engn, Xian, Peoples R China
[2] Xi An Jiao Tong Univ, Shaanxi Key Lab Smart Grid, Xian, Peoples R China
[3] Xi An Jiao Tong Univ, State Key Lab Elect Insulat & Power Equipment, Xian, Peoples R China
[4] Power Stn Inst New Energy Tech Supervis, Dept Tech Supervis, Xian, Peoples R China
基金
中国博士后科学基金;
关键词
adaptive dynamic programming; barrier function; bounded rationality; human-machine collaboration; shared control; HUMAN-ROBOT INTERACTION; GAMES; ADAPTATION; DRIVER;
D O I
10.1002/rnc.7931
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers the shared control of bounded rational human behavior with cooperative autonomous machines. For the collaboration of humans and machines, it is crucial to ensure the safety of the interactive process due to the involvement of human beings. First, a barrier-function-based state transformation is developed to ensure full state safety constraints. A level-k$$ k $$ thinking framework is exploited to obtain bounded rationality. Every single level-k$$ k $$ control policy is approximated by using adaptive dynamic programming. Inspired by the theory of human behavior modeling, a probabilistic distribution based on Softmax is utilized to model human behavior, which imitates the uncertainty of human intelligence in the cooperative game. Through the construction of a shared control framework, the control inputs of humans and machines are blended to achieve stabilization safely and efficiently. Finally, simulations are implemented to test the effectiveness of the proposed cooperation architecture. The result demonstrates that full-state asymmetric constraints and stabilization are guaranteed in commonly safety-critical situations, and the shared control framework ensures the safety of the overall system when one of the participants is not safety-aware.
引用
收藏
页码:4638 / 4657
页数:20
相关论文
共 55 条
[1]  
Baker C. L., BAYESIAN MODELS HUMA
[2]   Data-driven Koopman operators for model-based shared control of human-machine systems [J].
Broad, Alexander ;
Abraham, Ian ;
Murphey, Todd ;
Argall, Brenna .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (09) :1178-1195
[3]   A cognitive hierarchy model of games [J].
Camerer, CF ;
Ho, TH ;
Chong, JK .
QUARTERLY JOURNAL OF ECONOMICS, 2004, 119 (03) :861-898
[4]   Safe exploration in model-based reinforcement learning using control barrier functions [J].
Cohen, Max H. ;
Belta, Calin .
AUTOMATICA, 2023, 147
[5]   Off-policy algorithm based Hierarchical optimal control for completely unknown dynamic systems [J].
Cui, Xiaohong ;
Chen, Jiayu ;
Wang, Binrui ;
Xu, Suan .
NEUROCOMPUTING, 2022, 488 :669-680
[6]   A policy-blending formalism for shared control [J].
Dragan, Anca D. ;
Srinivasa, Siddhartha S. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (07) :790-805
[7]   Human-Like Decision Making for Autonomous Driving: A Noncooperative Game Theoretic Approach [J].
Hang, Peng ;
Lv, Chen ;
Xing, Yang ;
Huang, Chao ;
Hu, Zhongxu .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (04) :2076-2087
[8]   Learning-Based Adaptive Optimal Control for Connected Vehicles in Mixed Traffic: Robustness to Driver Reaction Time [J].
Huang, Mengzhe ;
Jiang, Zhong-Ping ;
Ozbay, Kaan .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) :5267-5277
[9]   Probabilistic Human Intent Recognition for Shared Autonomy in Assistive Robotics [J].
Jain, Siddarth ;
Argall, Brenna .
ACM TRANSACTIONS ON HUMAN-ROBOT INTERACTION, 2020, 9 (01)
[10]   Approximate optimal trajectory tracking for continuous-time nonlinear systems [J].
Kamalapurkar, Rushikesh ;
Dinh, Huyen ;
Bhasin, Shubhendu ;
Dixon, Warren E. .
AUTOMATICA, 2015, 51 :40-48