Projective simulation for artificial intelligence

被引：99

作者：

Briegel, Hans J. ^{[1
,2
]}

De las Cuevas, Gemma ^{[1
,2
]}

机构：

[1] Univ Innsbruck, Inst Theoret Phys, A-6020 Innsbruck, Austria

[2] Austrian Acad Sci, Inst Quantenopt & Quanteninformat, Innsbruck, Austria

来源：

SCIENTIFIC REPORTS | 2012年 / 2卷

基金：

奥地利科学基金会;

关键词：

QUANTUM; SYSTEMS; MEMORY; MAPS;

D O I：

10.1038/srep00400

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

We propose a model of a learning agent whose interaction with the environment is governed by a simulation-based projection, which allows the agent to project itself into future situations before it takes real action. Projective simulation is based on a random walk through a network of clips, which are elementary patches of episodic memory. The network of clips changes dynamically, both due to new perceptual input and due to certain compositional principles of the simulation process. During simulation, the clips are screened for specific features which trigger factual action of the agent. The scheme is different from other, computational, notions of simulation, and it provides a new element in an embodied cognitive science approach to intelligent action and learning. Our model provides a natural route for generalization to quantum-mechanical operation and connects the fields of reinforcement learning and quantum computation.

引用

页数：16

共 47 条

[21]

Igor Antonov, 2003, NEURON, V37

[22]

INGVAR DH, 1985, HUM NEUROBIOL, V4, P127

[23]

Julia Kempe, 2003, CONTEMP PHYS, V44

[24] SELF-IMPROVING REACTIVE AGENTS BASED ON REINFORCEMENT LEARNING, PLANNING AND TEACHING [J].

LIN, LJ .

MACHINE LEARNING, 1992, 8 (3-4) :293-321

[25]

Martin Heisenberg, 2010, ESF EMBO C FUNCT NEU

[26]

Mautner J., 2012, UNPUB

[27]

McCallum R. A., 1995, Machine Learning. Proceedings of the Twelfth International Conference on Machine Learning, P387

[28]

Nielsen M. A., 2000, Quantum Computation and Quantum Information

[29]

Ormoneit D., 2002, MACH LEARN, V49

[30]

Parr R, 1998, ADV NEUR IN, V10, P1043

← 1 2 3 4 5 →