On the utility of dreaming: A general model for how learning in artificial agents can benefit from data hallucination

被引:4
作者
Windridge, David [1 ,2 ]
Svensson, Henrik [3 ]
Thill, Serge [3 ,4 ]
机构
[1] Middlesex Univ, Dept Comp Sci, London NW4 4BT, England
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Surrey, England
[3] Univ Skovde, Interact Lab, Skovde, Sweden
[4] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, Nijmegen, Netherlands
基金
欧盟地平线“2020”;
关键词
Artificial dream mechanisms; data simulation; machine learning; reinforcement learning; SIMULATION THEORY;
D O I
10.1177/1059712319896489
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the benefits of dream mechanisms - that is, the ability to simulate new experiences based on past ones - in a machine learning context. Specifically, we are interested in learning for artificial agents that act in the world, and operationalize "dreaming" as a mechanism by which such an agent can use its own model of the learning environment to generate new hypotheses and training data. We first show that it is not necessarily a given that such a data-hallucination process is useful, since it can easily lead to a training set dominated by spurious imagined data until an ill-defined convergence point is reached. We then analyse a notably successful implementation of a machine learning-based dreaming mechanism by Ha and Schmidhuber (Ha, D., & Schmidhuber, J. (2018). World models. arXiv e-prints, arXiv:1803.10122). On that basis, we then develop a general framework by which an agent can generate simulated data to learn from in a manner that is beneficial to the agent. This, we argue, then forms a general method for an operationalized dream-like mechanism. We finish by demonstrating the general conditions under which such mechanisms can be useful in machine learning, wherein the implicit simulator inference and extrapolation involved in dreaming act without reinforcing inference error even when inference is incomplete.
引用
收藏
页码:267 / 280
页数:14
相关论文
共 38 条