Actor-Q based active perception learning system

被引：0

作者：

Shibata, K ^{[1
]}

Nishino, T ^{[1
]}

Okabe, Y ^{[1
]}

机构：

[1] Oita Univ, Dept Elect & Elect Engn, Oita 8701192, Japan

来源：

2001 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS | 2001年

关键词：

Actor-Q architecture; reinforcement learning; neural network; active perception; visual sensor;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

An active perception learning system based on reinforcement learning is proposed. A novel reinforcement architecture called Actor-Q is employed in which Q-learning and Actor-Critic are combined. The system decides its actions according to Q-values. One of the actions is to move its sensor and the others are to make all answer of its recognition result, each of which corresponds to each pattern. When the sensor motion is selected the sensor moves according to thc actor's output signals. The Q-value for the sensor motion is trained by Q-learning. and the Actor is trained hy the Q-value for the sensor motion on behalf of the critic When one of the other actions is selected the system outputs the recognition result. When the recognition answer is correct, the Q-value is trained to be the upper limit of the Q-value, and when the answer is not correct, it is trained to be 0.0. The module to compute Q-value and the actor module are both consisted of a neural network and are trained by Error Back Propagation. The training signals are generated based on the above reinforcement learning. It was confirmed by some simulations using a visual sensor with non-uniform visual cells that the system moves its sensor to the place where it can recognize the presented pattern correctly. Even though the Q-value surface as a function of the sensor location has some local peaks. the sensor was not trapped and moved to the appropriate direction because the Q-value for the sensor motion becomes larger.

引用

页码：1000 / 1005

页数：6

共 50 条

[21] Development and Validation of Active Roll Control based on Actor-critic Neural Network Reinforcement Learning
Bahr, Matthias
Reicherts, Sebastian
Sieberg, Philipp
Morss, Luca
Schramm, Dieter
SIMULTECH: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON SIMULATION AND MODELING METHODOLOGIES, TECHNOLOGIES AND APPLICATIONS, 2019, 2019, : 36 - 46
[22] Active perception based on deep reinforcement learning for autonomous robotic damage inspection
Tang, Wen
Jahanshahi, Mohammad R.
MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
[23] An Active Learning-Based Medical Diagnosis System
Pinto, Catarina
Faria, Juliana
Macedo, Luis
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2022, 2022, 13566 : 207 - 218
[24] Designing an active learning based system for corpus annotation
Busser, Bertjan
Morante, Roser
PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 375 - 381
[25] Learning About Service Orientation in KIBS: Understanding the Customer as an Active Actor
Kallio, Katri
Lappalainen, Inka
SERVICE SCIENCE, 2014, 6 (02) : 78 - 91
[26] Energy Cooperation in CoMP System Based on Q-learning
Lv, Yabo
Li, Baogang
Yao, Yuanbin
Guo, Dandan
PROCEEDINGS OF 2017 11TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2017, : 90 - 94
[27] A Hybrid Web Recommender System Based on Q-Learning
Taghipour, Nima
Kardan, Ahmad
APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1164 - 1168
[28] Reversely Discovering and Modifying Properties Based on Active Deep Q-Learning
Yu, Lei
Huo, Zhifa
IEEE ACCESS, 2020, 8 : 157819 - 157829
[29] HAPTIC SENSING SYSTEM WITH ACTIVE PERCEPTION
SAKAGUCHI, Y
ADVANCED ROBOTICS, 1994, 8 (03) : 263 - 283
[30] Active perception and map learning for robot navigation
Filliat, D
Meyer, JA
FROM ANIMALS TO ANIMATS 6, 2000, : 246 - 255

← 1 2 3 4 5 →