Efficient DDPG via the Self-Supervised Method

被引：0

作者：

Zhang, Guanghao ^{[1
]}

Chen, Hongliang ^{[2
]}

Li, Jianxun ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China

[2] AVIC, Inst Electroopt Equipment, Luoyang 4710009, Peoples R China

来源：

PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020) | 2020年

关键词：

Efficient DDPG; Self-Supervised Method; Inverse and Forward Model; MODEL;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, embedded with self-supervised learning network, an efficient DDPG(Deep Deterministic Policy Gradient) RL algorithm is investigated. With more essential characteristics of observing data included, the inputs of actor network and critic network of DDPG are replaced by the high-dimensional outputs from feature extracting layers and forward network respectively. Additionally, the parameters of these auxiliary layers are optimized with a self-supervised method by minimizing predicting errors, and thus both optimizing progresses can run parallelly and simultaneously. Lastly, an antagonistic air-fight simulation with a novel customized training index is introduced to perform the effectiveness and rising efficiency of our self-supervised DDPG RL algorithm.

引用

页码：4636 / 4642

页数：7

共 50 条

[31] Consequential Advancements of Self-Supervised Learning (SSL) in Deep Learning Contexts [J].

Abdulrazzaq, Mohammed Majid ;

Ramaha, Nehad T. A. ;

Hameed, Alaa Ali ;

Salman, Mohammad ;

Yon, Dong Keon ;

Fitriyani, Norma Latif ;

Syafrudin, Muhammad ;

Lee, Seung Won .

MATHEMATICS, 2024, 12 (05)

[32] Self-supervised anomaly detection based on foreground enhancement and autoencoder reconstruction [J].

Zhao, Lijie ;

Chai, Yuan ;

Zhang, Qichun ;

Karimi, Hamid Reza .

SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) :343-350

[33] A Comparative Study of Self-Supervised Speech Representation Based Voice Conversion [J].

Huang, Wen-Chin ;

Yang, Shu-Wen ;

Hayashi, Tomoki ;

Toda, Tomoki .

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) :1308-1318

[34] Self-Supervised Deep Learning for 3D Gravity Inversion [J].

Li, Yinshuo ;

Jia, Zhuo ;

Lu, Wenkai .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[35] Self-supervised anomaly detection based on foreground enhancement and autoencoder reconstruction [J].

Lijie Zhao ;

Yuan Chai ;

Qichun Zhang ;

Hamid Reza Karimi .

Signal, Image and Video Processing, 2024, 18 (1) :343-350

[36] Self-Supervised SAR Despeckling Powered by Implicit Deep Denoiser Prior [J].

Lin, Huangxing ;

Zhuang, Yihong ;

Huang, Yue ;

Ding, Xinghao .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19

[37] Structural representation learning for network alignment with self-supervised anchor links [J].

Thanh Toan Nguyen ;

Minh Tam Pham ;

Thanh Tam Nguyen ;

Thanh Trung Huynh ;

Van Vinh Tong ;

Quoc Viet Hung Nguyen ;

Thanh Tho Quan .

EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165

[38] A novel self-supervised contrastive learning based sentence-level attribute induction method for online satisfaction evaluation [J].

Zhou, Zhichu ;

Ji, Feixia ;

Chang, Xiaokun ;

Liu, Yujia ;

Fujita, Hamido ;

Wu, Jian .

COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 189

[39] Pa(sic)-HuBERT: SELF-SUPERVISED MUSIC SOURCE SEPARATION VIA PRIMITIVE AUDITORY CLUSTERING AND HIDDEN-UNIT BERT [J].

Chen, Ke ;

Wichern, Gordon ;

Germain, Francois G. ;

Le Roux, Jonathan .

2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,

[40] Word Segmentation on Discovered Phone Units With Dynamic Programming and Self-Supervised Scoring [J].

Kamper, Herman .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 :684-694

← 1 2 3 4 5 →