Deep Reinforcement Learning: An Overview

被引：242

作者：

Mousavi, Seyed Sajad ^{[1
]}

Schukat, Michael ^{[1
]}

Howley, Enda ^{[1
]}

机构：

[1] Natl Univ Ireland, Coll Engn & Informat, Galway, Ireland

来源：

PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2 | 2018年 / 16卷

关键词：

Reinforcement learning; Deep leaning; Neural networks; MDPs; Observable MDPs; REPRESENTATIONS; NETWORK;

D O I：

10.1007/978-3-319-56991-8_32

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, a specific machine learning method called deep learning has gained huge attraction, as it has obtained astonishing results in broad applications such as pattern recognition, speech recognition, computer vision, and natural language processing. Recent research has also been shown that deep learning techniques can be combined with reinforcement learning methods to learn useful representations for the problems with high dimensional raw data input. This article reviews the recent advances in deep reinforcement learning with focus on the most used deep architectures such as autoencoders, convolutional neural networks and recurrent neural networks which have successfully been come together with the reinforcement learning framework.

引用

页码：426 / 440

页数：15

共 45 条

[1]

[Anonymous], 2015, J. Mach. Learn. Res.

[2]

[Anonymous], 2003, P 2003 IEEE RSJ INT

[3]

[Anonymous], 2010, IEEE IJCNN

[4] Evolution strategies – A comprehensive introduction [J].

Hans-Georg Beyer ;

Hans-Paul Schwefel .

Natural Computing, 2002, 1 (1) :3-52

[5]

Barto AG, 2003, DISCRETE EVENT DYN S, V13, P41, DOI [10.1023/A:1022140919877, 10.1023/A:1025696116075]

[6] The Arcade Learning Environment: An Evaluation Platform for General Agents [J].

Bellemare, Marc G. ;

Naddaf, Yavar ;

Veness, Joel ;

Bowling, Michael .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 :253-279

[7] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].

BENGIO, Y ;

SIMARD, P ;

FRASCONI, P .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166

[8]

Bengio Y., 2006, ADV NEURAL INFORM PR, V19

[9] Representation Learning: A Review and New Perspectives [J].

Bengio, Yoshua ;

Courville, Aaron ;

Vincent, Pascal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828

[10] Learning Deep Architectures for AI [J].

Bengio, Yoshua .

FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127

← 1 2 3 4 5 →