Efficient DDPG via the Self-Supervised Method

被引:0
作者
Zhang, Guanghao [1 ]
Chen, Hongliang [2 ]
Li, Jianxun [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[2] AVIC, Inst Electroopt Equipment, Luoyang 4710009, Peoples R China
来源
PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020) | 2020年
关键词
Efficient DDPG; Self-Supervised Method; Inverse and Forward Model; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, embedded with self-supervised learning network, an efficient DDPG(Deep Deterministic Policy Gradient) RL algorithm is investigated. With more essential characteristics of observing data included, the inputs of actor network and critic network of DDPG are replaced by the high-dimensional outputs from feature extracting layers and forward network respectively. Additionally, the parameters of these auxiliary layers are optimized with a self-supervised method by minimizing predicting errors, and thus both optimizing progresses can run parallelly and simultaneously. Lastly, an antagonistic air-fight simulation with a novel customized training index is introduced to perform the effectiveness and rising efficiency of our self-supervised DDPG RL algorithm.
引用
收藏
页码:4636 / 4642
页数:7
相关论文
共 50 条
[21]   Self-supervised Signal Denoising for Magnetic Particle Imaging [J].
Peng, Huiling ;
Li, Yimeng ;
Yang, Xin ;
Tian, Jie ;
Hui, Hui .
2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
[22]   SeamlessGAN: Self-Supervised Synthesis of Tileable Texture Maps [J].
Rodriguez-Pardo, Carlos ;
Garces, Elena .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (06) :2914-2925
[23]   A Self-Supervised Learning-Based 6-DOF Grasp Planning Method for Manipulator [J].
Peng, Gang ;
Ren, Zhenyu ;
Wang, Hao ;
Li, Xinde ;
Khyam, Mohammad Omar .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (04) :3639-3648
[24]   CrossHAR: Generalizing Cross-dataset Human Activity Recognition via Hierarchical Self-Supervised Pretraining [J].
Hong, Zhiqing ;
Li, Zelong ;
Zhong, Shuxin ;
Lyu, Wenjun ;
Wang, Haotian ;
Ding, Yi ;
He, Tian ;
Zhang, Desheng .
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2024, 8 (02)
[25]   TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech [J].
Liu, Andy T. ;
Li, Shang-Wen ;
Lee, Hung-yi .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 :2351-2366
[26]   Appearance Consensus Driven Self-supervised Human Mesh Recovery [J].
Kundu, Jogendra Nath ;
Rakesh, Mugalodi ;
Jampani, Varun ;
Venkatesh, Rahul Mysore ;
Babu, R. Venkatesh .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :794-812
[27]   FLUID: Few-Shot Self-Supervised Image Deraining [J].
Rai, Shyam Nandan ;
Saluja, Rohit ;
Arora, Chetan ;
Balasubramanian, Vineeth N. ;
Subramanian, Anbumani ;
Jawahar, C., V .
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :418-427
[28]   A General Self-Supervised Framework for Remote Sensing Image Classification [J].
Gao, Yuan ;
Sun, Xiaojuan ;
Liu, Chao .
REMOTE SENSING, 2022, 14 (19)
[29]   Self-supervised learning for MRI reconstruction: a review and new perspective [J].
Li, Xinzhen ;
Huang, Jinhong ;
Sun, Guanglong ;
Yang, Zihan .
MAGNETIC RESONANCE MATERIALS IN PHYSICS BIOLOGY AND MEDICINE, 2025,
[30]   Self-Supervised Marine Organism Detection From Underwater Images [J].
Li, Jiahua ;
Yang, Wentao ;
Qiao, Shishi ;
Gu, Zhaorui ;
Zheng, Bing ;
Zheng, Haiyong .
IEEE JOURNAL OF OCEANIC ENGINEERING, 2025, 50 (01) :120-135