Actor-Q based active perception learning system

被引:0
|
作者
Shibata, K [1 ]
Nishino, T [1 ]
Okabe, Y [1 ]
机构
[1] Oita Univ, Dept Elect & Elect Engn, Oita 8701192, Japan
来源
2001 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS | 2001年
关键词
Actor-Q architecture; reinforcement learning; neural network; active perception; visual sensor;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An active perception learning system based on reinforcement learning is proposed. A novel reinforcement architecture called Actor-Q is employed in which Q-learning and Actor-Critic are combined. The system decides its actions according to Q-values. One of the actions is to move its sensor and the others are to make all answer of its recognition result, each of which corresponds to each pattern. When the sensor motion is selected the sensor moves according to thc actor's output signals. The Q-value for the sensor motion is trained by Q-learning. and the Actor is trained hy the Q-value for the sensor motion on behalf of the critic When one of the other actions is selected the system outputs the recognition result. When the recognition answer is correct, the Q-value is trained to be the upper limit of the Q-value, and when the answer is not correct, it is trained to be 0.0. The module to compute Q-value and the actor module are both consisted of a neural network and are trained by Error Back Propagation. The training signals are generated based on the above reinforcement learning. It was confirmed by some simulations using a visual sensor with non-uniform visual cells that the system moves its sensor to the place where it can recognize the presented pattern correctly. Even though the Q-value surface as a function of the sensor location has some local peaks. the sensor was not trapped and moved to the appropriate direction because the Q-value for the sensor motion becomes larger.
引用
收藏
页码:1000 / 1005
页数:6
相关论文
共 50 条
  • [21] Development and Validation of Active Roll Control based on Actor-critic Neural Network Reinforcement Learning
    Bahr, Matthias
    Reicherts, Sebastian
    Sieberg, Philipp
    Morss, Luca
    Schramm, Dieter
    SIMULTECH: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON SIMULATION AND MODELING METHODOLOGIES, TECHNOLOGIES AND APPLICATIONS, 2019, 2019, : 36 - 46
  • [22] Active perception based on deep reinforcement learning for autonomous robotic damage inspection
    Tang, Wen
    Jahanshahi, Mohammad R.
    MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
  • [23] An Active Learning-Based Medical Diagnosis System
    Pinto, Catarina
    Faria, Juliana
    Macedo, Luis
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2022, 2022, 13566 : 207 - 218
  • [24] Designing an active learning based system for corpus annotation
    Busser, Bertjan
    Morante, Roser
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 375 - 381
  • [25] Learning About Service Orientation in KIBS: Understanding the Customer as an Active Actor
    Kallio, Katri
    Lappalainen, Inka
    SERVICE SCIENCE, 2014, 6 (02) : 78 - 91
  • [26] Energy Cooperation in CoMP System Based on Q-learning
    Lv, Yabo
    Li, Baogang
    Yao, Yuanbin
    Guo, Dandan
    PROCEEDINGS OF 2017 11TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2017, : 90 - 94
  • [27] A Hybrid Web Recommender System Based on Q-Learning
    Taghipour, Nima
    Kardan, Ahmad
    APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1164 - 1168
  • [28] Reversely Discovering and Modifying Properties Based on Active Deep Q-Learning
    Yu, Lei
    Huo, Zhifa
    IEEE ACCESS, 2020, 8 : 157819 - 157829
  • [29] HAPTIC SENSING SYSTEM WITH ACTIVE PERCEPTION
    SAKAGUCHI, Y
    ADVANCED ROBOTICS, 1994, 8 (03) : 263 - 283
  • [30] Active perception and map learning for robot navigation
    Filliat, D
    Meyer, JA
    FROM ANIMALS TO ANIMATS 6, 2000, : 246 - 255