Reinforcement Learning in Few-Shot Scenarios: A Survey

被引：5

作者：

Wang, Zhechao ^{[1
,2
]}

Fu, Qiming ^{[1
,2
]}

Chen, Jianping ^{[2
,3
]}

Wang, Yunzhe ^{[1
,2
]}

Lu, You ^{[1
,2
]}

Wu, Hongjie ^{[1
,2
]}

机构：

[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China

[2] Suzhou Univ Sci & Technol, Jiangsu Prov Key Lab Intelligent Bldg Energy Effic, Suzhou 215009, Peoples R China

[3] Suzhou Univ Sci & Technol, Sch Architecture & Urban Planning, Suzhou 215009, Peoples R China

来源：

JOURNAL OF GRID COMPUTING | 2023年 / 21卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Few-shot learning; Meta-learning; Transfer learning; FRAMEWORK; MODEL;

D O I：

10.1007/s10723-023-09663-0

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning has a demand for massive data in complex problems, which makes it infeasible to be applied to real cases where sampling is difficult. The key to coping with these few-shot problems is knowledge generalization, and related algorithms are often called few-shot reinforcement learning (FS-RL). However, there lacks a formal definition and comprehensive analyses of few-shot scenarios and FS-RL algorithms. Therefore, after giving a uniform definition, we categorize few-shot scenarios into two types. The first type pursues more professional performance, while the other one pursues more general performance. In the process of knowledge transfer, few-shot scenarios usually have an obvious tendency to some type of knowledge. Based on this, we divide FS-RL algorithms into two types: the direct transfer case and the indirect transfer case. Thereafter, existing algorithms are discussed under this classification. Finally, we discuss future directions of FS-RL from the aspect of both theory and application.

引用

页数：22

共 91 条

[21] Stabilization Approaches for Reinforcement Learning-Based End-to-End Autonomous Driving [J].

Chen, Siyuan ;

Wang, Meiling ;

Song, Wenjie ;

Yang, Yi ;

Li, Yujun ;

Fu, Mengyin .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (05) :4740-4750

[22]

Chen YT, 2017, PR MACH LEARN RES, V70

[23]

Chua K, 2018, ADV NEUR IN, V31

[24]

Cortes C, 2008, LECT NOTES ARTIF INT, V5254, P38, DOI 10.1007/978-3-540-87987-9_8

[25]

Dayan P., 1992, ADV NEURAL INFORM PR, P271, DOI DOI 10.5555/2987061.2987095

[26]

Deleu T, 2018, Arxiv, DOI arXiv:1812.02159

[27]

Devlin S., 2012, P INT C AUT AG MULT, P433

[28]

Ernst D, 2005, J MACH LEARN RES, V6, P503

[29]

Fakoor R., 2020, P INT C LEARN REPR, P1

[30] A Bayesian approach to unsupervised one-shot learning of object categories [J].

Fei-Fei, L ;

Fergus, R ;

Perona, P .

NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, :1134-1141

← 1 2 3 4 5 6 7 8 9 10 →