A-learning: A new formulation of associative learning theory

被引:8
|
作者
Ghirlanda, Stefano [1 ,2 ,3 ]
Lind, Johan [3 ]
Enquist, Magnus [3 ]
机构
[1] CUNY, Brooklyn Coll, New York, NY 10021 USA
[2] CUNY, Grad Ctr, New York, NY 10021 USA
[3] Stockholm Univ, Stockholm, Sweden
基金
美国国家科学基金会;
关键词
Associative learning; Pavlovian conditioning; Instrumental conditioning; Mathematical model; Conditioned reinforcement; Outcome revaluation; UNCONDITIONED STIMULUS; EXTINCTION; BEHAVIOR; REINFORCEMENT; MODEL; AUTOMAINTENANCE; OPERANT; WATER; ORGANIZATION; CONTINGENCY;
D O I
10.3758/s13423-020-01749-0
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
We present a new mathematical formulation of associative learning focused on non-human animals, which we call A-learning. Building on current animal learning theory and machine learning, A-learning is composed of two learning equations, one for stimulus-response values and one for stimulus values (conditioned reinforcement). A third equation implements decision-making by mapping stimulus-response values to response probabilities. We show that A-learning can reproduce the main features of: instrumental acquisition, including the effects of signaled and unsignaled non-contingent reinforcement; Pavlovian acquisition, including higher-order conditioning, omission training, autoshaping, and differences in form between conditioned and unconditioned responses; acquisition of avoidance responses; acquisition and extinction of instrumental chains and Pavlovian higher-order conditioning; Pavlovian-to-instrumental transfer; Pavlovian and instrumental outcome revaluation effects, including insight into why these effects vary greatly with training procedures and with the proximity of a response to the reinforcer. We discuss the differences between current theory and A-learning, such as its lack of stimulus-stimulus and response-stimulus associations, and compare A-learning with other temporal-difference models from machine learning, such as Q-learning, SARSA, and the actor-critic model. We conclude that A-learning may offer a more convenient view of associative learning than current mathematical models, and point out areas that need further development.
引用
收藏
页码:1166 / 1194
页数:29
相关论文
共 50 条
  • [21] Positive reinforcement is just the beginning: Associative learning principles for energy efficiency and climate sustainability
    Schneider, Susan M.
    Sanguinetti, Angela
    ENERGY RESEARCH & SOCIAL SCIENCE, 2021, 74
  • [22] Neuronal ensemble dynamics in associative learning
    Takehara-Nishiuchi, Kaori
    CURRENT OPINION IN NEUROBIOLOGY, 2022, 73
  • [23] Sense of agency, associative learning, and schizotypy
    Moore, James W.
    Dickinson, Anthony
    Fletcher, Paul C.
    CONSCIOUSNESS AND COGNITION, 2011, 20 (03) : 792 - 800
  • [24] Associative foundation of causal learning in rats
    Polack, Cody W.
    McConnell, Bridget L.
    Miller, Ralph R.
    LEARNING & BEHAVIOR, 2013, 41 (01) : 25 - 41
  • [25] Context extinction and associative learning in Lymnaea
    McComb, C
    Sangha, S
    Qadry, S
    Yue, J
    Scheibenstock, A
    Lukowiak, K
    NEUROBIOLOGY OF LEARNING AND MEMORY, 2002, 78 (01) : 23 - 34
  • [26] Associative Relationships in Human Predictive Learning
    Gamez, A. Matias
    Martos, Rafael
    Abad, Maria J. F.
    Rosas, Juan M.
    SPANISH JOURNAL OF PSYCHOLOGY, 2013, 16
  • [27] Understanding Associative Learning Through Higher-Order Conditioning
    Gostolupce, Dilara
    Lay, Belinda P. P.
    Maes, Etienne J. P.
    Iordanova, Mihaela D.
    FRONTIERS IN BEHAVIORAL NEUROSCIENCE, 2022, 16
  • [28] Time and Associative Learning
    Balsam, Peter D.
    Drew, Michael R.
    Gallistel, C. R.
    COMPARATIVE COGNITION & BEHAVIOR REVIEWS, 2010, 5 : 1 - 22
  • [29] The Role of Associative Learning in Ant Learning and Memory
    Fenli, Ezgi
    Mert, Ozgur
    Aksoy, Volkan
    JOURNAL OF INSECT BEHAVIOR, 2025, 38 (01)
  • [30] Learning in virtual environments: Some discrepancies between laboratory- and Internet-based research on associative learning
    Vadillo, Miguel A.
    Matute, Helena
    COMPUTERS IN HUMAN BEHAVIOR, 2009, 25 (02) : 402 - 406