Learning task-specific sensing, control and memory policies

被引：0

作者：

Rajendran, S ^{[1
]}

Huber, M ^{[1
]}

机构：

[1] Univ Texas, Dept Comp & Engn, Arlington, TX 76019 USA

来源：

INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS | 2005年 / 14卷 / 1-2期

关键词：

focus of attention; event memory; reinforcement learning;

D O I：

10.1142/S0218213005002119

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

AI agents and robots that can adapt and handle multiple tasks in real time promise to be a powerful tool. To address the control challenges involved in such systems, the underlying control approach has to take into account the important sensory information. Modern sensors, however, can generate huge amounts of data, rendering the processing and representation of all sensor data in real time computationally intractable. This issue can be addressed by developing task-specific focus of attention strategies that limit the sensory data that is processed at any point in time to the data relevant for the given task. Alone, however, this mechanism is not adequate for solving complex tasks since the robot also has to maintain selected pieces of past information. This necessitates AI agents and robots to have the capability to remember significant past events that are required for task completion. This paper presents an approach that considers focus of attention as a problem of selecting controller and feature pairs to be processed at any given point in time to optimize system performance. This approach is further extended by incorporating short term memory and a learned memory management policy. The result is a system that learns control, sensing, and memory policies that are task-specific and adaptable to real world situations using feedback from the world in a reinforcement learning framework. The approach is illustrated using table cleaning, sorting, star-king, and copying tasks in the blocks world domain.

引用

页码：303 / 327

页数：25

共 50 条

[41] Convergence of Finite Memory Q Learning for POMDPs and Near Optimality of Learned Policies Under Filter Stability
Kara, Ali Devran
Yuksel, Serdar
MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (04) : 2066 - 2093
[42] Collision-Avoiding Flocking With Multiple Fixed-Wing UAVs in Obstacle-Cluttered Environments: A Task-Specific Curriculum-Based MADRL Approach
Yan, Chao
Wang, Chang
Xiang, Xiaojia
Low, Kin Huat
Wang, Xiangke
Xu, Xin
Shen, Lincheng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10894 - 10908
[43] Learning Cost-Efficient Control Policies with XCSF: Generalization Capabilities and Further Improvement
Marin, Didier
Decock, Jeremie
Rigoux, Lionel
Sigaud, Olivier
GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 1235 - 1242
[44] How to Ask for Donations? Learning User-Specific Persuasive Dialogue Policies through Online Interactions
Tran, Nhat
Alikhani, Malihe
Litman, Diane
PROCEEDINGS OF THE 30TH ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2022, 2022, : 12 - 22
[45] Online learning of task-driven object-based visual attention control
Borji, Ali
Ahmadabadi, Majid Nil
Araabi, Babak Nadjar
Hamidi, Mandana
IMAGE AND VISION COMPUTING, 2010, 28 (07) : 1130 - 1145
[46] Shared Impedance Control Based on Reinforcement Learning in a Human-Robot Collaboration Task
Wu, Min
He, Yanhao
Liu, Steven
ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, 2020, 980 : 95 - 103
[47] A Decision Control Method for Autonomous Driving Based on Multi-Task Reinforcement Learning
Cai, Yingfeng
Yang, Shaoqing
Wang, Hai
Teng, Chenglong
Chen, Long
IEEE ACCESS, 2021, 9 (09): : 154553 - 154562
[48] Multi-agent reinforcement learning for redundant robot control in task-space
Adolfo Perrusquía
Wen Yu
Xiaoou Li
International Journal of Machine Learning and Cybernetics, 2021, 12 : 231 - 241
[49] Effect of Attentional Focus on Muscles Activations and Their Recruitment During Learning a Balance Control Task
Taleshi, Naser
Taleshi, Mansour
Dehghani, Sedigheh
Bahrami, Fariba
Jamshidi, Ali Ashraf
2016 23RD IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING AND 2016 1ST INTERNATIONAL IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING (ICBME), 2016, : 307 - 310
[50] Cognitive Control Over Learning: Creating, Clustering, and Generalizing Task-Set Structure
Collins, Anne G. E.
Frank, Michael J.
PSYCHOLOGICAL REVIEW, 2013, 120 (01) : 190 - 229

← 1 2 3 4 5 →