共 14 条
[1]
A policy gradient algorithm integrating long and short-term rewards for soft continuum arm control.[J].DONG Xiang;ZHANG Jing;CHENG Long;XU WenJun;SU Hang;MEI Tao;.Science China(Technological Sciences).2022, 10
[2]
Hawk and pigeon's intelligence for UAV swarm dynamic combat game via competitive learning pigeon-inspired optimization.[J].YU YuePing;LIU JiChuan;WEI Chen;.Science China(Technological Sciences).2022, 05
[3]
Formation control of quad-rotor UAV via PIO.[J].BAI TingTing;WANG DaoBo;MASOOD Rana Javed;.Science China(Technological Sciences).2022, 02
[4]
Convolution without multiplication: A general speed up strategy for CNNs.[J].CAI GuoRong;YANG ShengMing;DU Jing;WANG ZongYue;HUANG Bin;GUAN Yin;SU SongJian;SU JinHe;SU SongZhi;.Science China(Technological Sciences).2021, 12
[5]
Robust control of uncertain robotic systems:An adaptive friction compensation approach.[J].WANG QiShao;ZHUANG Han;DUAN ZhiSheng;WANG QingYun;.Science China(Technological Sciences).2021, 06
[9]
Actor-Critic Reinforcement Learning for Control with Stability Guarantee.[J].Minghao Han;Lixian Zhang;Jun Wang;Wei Pan.IEEE Robotics and Automation Letters.2020, 99
[10]
Wasserstein Robust Reinforcement Learning..[J].Mohammed Amin Abdullah;Hang Ren;Haitham Bou-Ammar;Vladimir Milenkovic;Rui Luo;Mingtian Zhang;Jun Wang 0012.CoRR.2019,