共 50 条
A-learning: A new formulation of associative learning theory
被引:8
|作者:
Ghirlanda, Stefano
[1
,2
,3
]
Lind, Johan
[3
]
Enquist, Magnus
[3
]
机构:
[1] CUNY, Brooklyn Coll, New York, NY 10021 USA
[2] CUNY, Grad Ctr, New York, NY 10021 USA
[3] Stockholm Univ, Stockholm, Sweden
基金:
美国国家科学基金会;
关键词:
Associative learning;
Pavlovian conditioning;
Instrumental conditioning;
Mathematical model;
Conditioned reinforcement;
Outcome revaluation;
UNCONDITIONED STIMULUS;
EXTINCTION;
BEHAVIOR;
REINFORCEMENT;
MODEL;
AUTOMAINTENANCE;
OPERANT;
WATER;
ORGANIZATION;
CONTINGENCY;
D O I:
10.3758/s13423-020-01749-0
中图分类号:
B841 [心理学研究方法];
学科分类号:
040201 ;
摘要:
We present a new mathematical formulation of associative learning focused on non-human animals, which we call A-learning. Building on current animal learning theory and machine learning, A-learning is composed of two learning equations, one for stimulus-response values and one for stimulus values (conditioned reinforcement). A third equation implements decision-making by mapping stimulus-response values to response probabilities. We show that A-learning can reproduce the main features of: instrumental acquisition, including the effects of signaled and unsignaled non-contingent reinforcement; Pavlovian acquisition, including higher-order conditioning, omission training, autoshaping, and differences in form between conditioned and unconditioned responses; acquisition of avoidance responses; acquisition and extinction of instrumental chains and Pavlovian higher-order conditioning; Pavlovian-to-instrumental transfer; Pavlovian and instrumental outcome revaluation effects, including insight into why these effects vary greatly with training procedures and with the proximity of a response to the reinforcer. We discuss the differences between current theory and A-learning, such as its lack of stimulus-stimulus and response-stimulus associations, and compare A-learning with other temporal-difference models from machine learning, such as Q-learning, SARSA, and the actor-critic model. We conclude that A-learning may offer a more convenient view of associative learning than current mathematical models, and point out areas that need further development.
引用
收藏
页码:1166 / 1194
页数:29
相关论文