On the utility of dreaming: A general model for how learning in artificial agents can benefit from data hallucination

被引：4

作者：

Windridge, David ^{[1
,2
]}

Svensson, Henrik ^{[3
]}

Thill, Serge ^{[3
,4
]}

机构：

[1] Middlesex Univ, Dept Comp Sci, London NW4 4BT, England

[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Surrey, England

[3] Univ Skovde, Interact Lab, Skovde, Sweden

[4] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, Nijmegen, Netherlands

来源：

ADAPTIVE BEHAVIOR | 2021年 / 29卷 / 03期

基金：

欧盟地平线“2020”;

关键词：

Artificial dream mechanisms; data simulation; machine learning; reinforcement learning; SIMULATION THEORY;

D O I：

10.1177/1059712319896489

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider the benefits of dream mechanisms - that is, the ability to simulate new experiences based on past ones - in a machine learning context. Specifically, we are interested in learning for artificial agents that act in the world, and operationalize "dreaming" as a mechanism by which such an agent can use its own model of the learning environment to generate new hypotheses and training data. We first show that it is not necessarily a given that such a data-hallucination process is useful, since it can easily lead to a training set dominated by spurious imagined data until an ill-defined convergence point is reached. We then analyse a notably successful implementation of a machine learning-based dreaming mechanism by Ha and Schmidhuber (Ha, D., & Schmidhuber, J. (2018). World models. arXiv e-prints, arXiv:1803.10122). On that basis, we then develop a general framework by which an agent can generate simulated data to learn from in a manner that is beneficial to the agent. This, we argue, then forms a general method for an operationalized dream-like mechanism. We finish by demonstrating the general conditions under which such mechanisms can be useful in machine learning, wherein the implicit simulator inference and extrapolation involved in dreaming act without reinforcing inference error even when inference is incomplete.

引用

页码：267 / 280

页数：14

共 38 条

[1] Computer science - What do robots dream of?
Adami, Christoph
[J]. SCIENCE, 2006, 314 (5802) : 1093 - 1094
[2] [Anonymous], 2018, ARXIV180507813
[3] A hybrid image dataset toward bridging the gap between real and simulation environments for robotics: Annotated desktop objects real and synthetic images dataset: ADORESet
Bayraktar, Ertugrul
Yigit, Cihat Bora
Boyraz, Pinar
[J]. MACHINE VISION AND APPLICATIONS, 2019, 30 (01) : 23 - 40
[4] Bojarski M., 2017, Explaining How a Deep Neural Network Trained with End-to-End Learning Steers a Car
[5] Borrego J., 2018, ARXIV180709834
[6] Dreaming, adaptation, and consciousness: The social mapping hypothesis
Brereton, DP
[J]. ETHOS, 2000, 28 (03) : 379 - 409
[7] REM-SLEEP AND NEURAL NETS
CRICK, F
MITCHISON, G
[J]. BEHAVIOURAL BRAIN RESEARCH, 1995, 69 (1-2) : 147 - 155
[8] THE FUNCTION OF DREAM SLEEP
CRICK, F
MITCHISON, G
[J]. NATURE, 1983, 304 (5922) : 111 - 114
[9] Foulkes David., 1985, Dreaming: A Cognitive-Psychological Analysis
[10] GAIDON A, 2016, 2016 IEEE C COMP VIS

← 1 2 3 4 →