A-learning: A new formulation of associative learning theory

被引:8
|
作者
Ghirlanda, Stefano [1 ,2 ,3 ]
Lind, Johan [3 ]
Enquist, Magnus [3 ]
机构
[1] CUNY, Brooklyn Coll, New York, NY 10021 USA
[2] CUNY, Grad Ctr, New York, NY 10021 USA
[3] Stockholm Univ, Stockholm, Sweden
基金
美国国家科学基金会;
关键词
Associative learning; Pavlovian conditioning; Instrumental conditioning; Mathematical model; Conditioned reinforcement; Outcome revaluation; UNCONDITIONED STIMULUS; EXTINCTION; BEHAVIOR; REINFORCEMENT; MODEL; AUTOMAINTENANCE; OPERANT; WATER; ORGANIZATION; CONTINGENCY;
D O I
10.3758/s13423-020-01749-0
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
We present a new mathematical formulation of associative learning focused on non-human animals, which we call A-learning. Building on current animal learning theory and machine learning, A-learning is composed of two learning equations, one for stimulus-response values and one for stimulus values (conditioned reinforcement). A third equation implements decision-making by mapping stimulus-response values to response probabilities. We show that A-learning can reproduce the main features of: instrumental acquisition, including the effects of signaled and unsignaled non-contingent reinforcement; Pavlovian acquisition, including higher-order conditioning, omission training, autoshaping, and differences in form between conditioned and unconditioned responses; acquisition of avoidance responses; acquisition and extinction of instrumental chains and Pavlovian higher-order conditioning; Pavlovian-to-instrumental transfer; Pavlovian and instrumental outcome revaluation effects, including insight into why these effects vary greatly with training procedures and with the proximity of a response to the reinforcer. We discuss the differences between current theory and A-learning, such as its lack of stimulus-stimulus and response-stimulus associations, and compare A-learning with other temporal-difference models from machine learning, such as Q-learning, SARSA, and the actor-critic model. We conclude that A-learning may offer a more convenient view of associative learning than current mathematical models, and point out areas that need further development.
引用
收藏
页码:1166 / 1194
页数:29
相关论文
共 50 条
  • [31] Memristive neural network circuit implementation of associative learning with overshadowing and blocking
    Liu, Jinying
    Zhou, Yue
    Duan, Shukai
    Hu, Xiaofang
    COGNITIVE NEURODYNAMICS, 2023, 17 (04) : 1029 - 1043
  • [32] Performance factors in associative learning: Assessment of the sometimes competing retrieval model
    Witnauer, James E.
    Wojick, Brittany M.
    Polack, Cody W.
    Miller, Ralph R.
    LEARNING & BEHAVIOR, 2012, 40 (03) : 347 - 366
  • [33] Appetitive Associative Olfactory Learning in Drosophila Larvae
    Apostolopoulou, Anthi A.
    Widmann, Annekathrin
    Rohwedder, Astrid
    Pfitzenmaier, Johanna E.
    Thum, Andreas S.
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2013, (72):
  • [34] Associative learning in the box jellyfish Tripedalia cystophora
    Bielecki, Jan
    Nielsen, Sofie Katrine Dam
    Nachman, Gosta
    Garm, Anders
    CURRENT BIOLOGY, 2023, 33 (19) : 4150 - +
  • [35] Synthetic associative learning in engineered multicellular consortia
    Macia, Javier
    Vidiella, Blai
    Sole, Ricard V.
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2017, 14 (129)
  • [36] The power of associative learning and the ontogeny of optimal behaviour
    Enquist, Magnus
    Lind, Johan
    Ghirlanda, Stefano
    ROYAL SOCIETY OPEN SCIENCE, 2016, 3 (11):
  • [37] Associative Learning of Social Value in Dynamic Groups
    FeldmanHall, Oriel
    Dunsmoor, Joseph E.
    Kroes, Marijn C. W.
    Lackovic, Sandra
    Phelps, Elizabeth A.
    PSYCHOLOGICAL SCIENCE, 2017, 28 (08) : 1160 - 1170
  • [38] Associative (not Hebbian) learning and the mirror neuron system
    Cooper, Richard P.
    Cook, Richard
    Dickinson, Anthony
    Heyes, Cecilia M.
    NEUROSCIENCE LETTERS, 2013, 540 : 28 - 36
  • [39] Simulation of associative learning with the replaced elements model
    Steven Glautier
    Behavior Research Methods, 2007, 39 : 993 - 1000
  • [40] Reaction time as a measure of human associative learning
    Craddock, Paul
    Molet, Mikael
    Miller, Ralph R.
    BEHAVIOURAL PROCESSES, 2012, 90 (02) : 189 - 197