A NEURAL NETWORK-LIKE CRITIC FOR REINFORCEMENT LEARNING

被引:1
|
作者
YAMAKAWA, H [1 ]
OKABE, Y [1 ]
机构
[1] UNIV TOKYO,TOKYO,JAPAN
关键词
REACTIVE SYSTEM; NEURAL NETWORK; AGENT; MAZE-LIKE ENVIRONMENT; RECURSIVE STRUCTURE; AMYGDALA;
D O I
10.1016/0893-6080(94)00086-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An adaptive agent that contains a reactive network and a critic that supervises that reactive network have been studied. Agent actions are generated in response to stimuli through the reactive network and they influence the ambient environment The critic has a new learning algorithm that recursively enhances reinforcement signals from fixed reinforcement signals by interacting with the environment. The reactive network learns appropriate stimulus-action relations by reinforcement learning. Computer simulation demonstrates that this neural critic is effective in environments where the concepts are embedded in a maze structure. We also suggest similarities between this critic model and the neural circuit in the human brain.
引用
收藏
页码:363 / 373
页数:11
相关论文
共 50 条
  • [41] Reinforcement learning of recurrent neural network for temporal coding
    Kimura, Daichi
    Hayakawa, Yoshinori
    NEUROCOMPUTING, 2008, 71 (16-18) : 3379 - 3386
  • [42] A Reinforcement Learning Neural Network for Robotic Manipulator Control
    Hu, Yazhou
    Si, Bailu
    NEURAL COMPUTATION, 2018, 30 (07) : 1983 - 2004
  • [43] Research on a reinforcement learning algorithm based on neural network
    Lu, Xin
    Gao, Yang
    Li, Ning
    Chen, Shi-Fu
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2002, 39 (08):
  • [44] Using a time-delay actor-critic neural architecture with dopamine-like reinforcement signal for learning in autonomous robots
    Pérez-Uribe, A
    EMERGENT NEURAL COMPUTATIONAL ARCHITECTURES BASED ON NEUROSCIENCE: TOWARDS NEUROSCIENCE-INSPIRED COMPUTING, 2001, 2036 : 522 - 533
  • [45] Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic Network
    Schott, Lucas
    Hajri, Hatem
    Lamprier, Sylvain
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [46] Network Congestion Control Algorithm Based on Actor-Critic Reinforcement Learning Model
    Xu, Tao
    Gong, Lina
    Zhang, Wei
    Li, Xuhong
    Wang, Xia
    Pan, Wenwen
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [47] A Motor Learning Neural Model based on Bayesian Network and Reinforcement Learning
    Hosoya, Haruo
    IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 760 - 767
  • [48] A World Model for Actor–Critic in Reinforcement Learning
    A. I. Panov
    L. A. Ugadiarov
    Pattern Recognition and Image Analysis, 2023, 33 : 467 - 477
  • [49] Totally model-free actor-critic recurrent neural-network reinforcement learning in non-Markovian domains
    Mizutani, Eiji
    Dreyfus, Stuart
    ANNALS OF OPERATIONS RESEARCH, 2017, 258 (01) : 107 - 131
  • [50] Totally model-free actor-critic recurrent neural-network reinforcement learning in non-Markovian domains
    Eiji Mizutani
    Stuart Dreyfus
    Annals of Operations Research, 2017, 258 : 107 - 131