Learning task-specific sensing, control and memory policies

被引:0
|
作者
Rajendran, S [1 ]
Huber, M [1 ]
机构
[1] Univ Texas, Dept Comp & Engn, Arlington, TX 76019 USA
关键词
focus of attention; event memory; reinforcement learning;
D O I
10.1142/S0218213005002119
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
AI agents and robots that can adapt and handle multiple tasks in real time promise to be a powerful tool. To address the control challenges involved in such systems, the underlying control approach has to take into account the important sensory information. Modern sensors, however, can generate huge amounts of data, rendering the processing and representation of all sensor data in real time computationally intractable. This issue can be addressed by developing task-specific focus of attention strategies that limit the sensory data that is processed at any point in time to the data relevant for the given task. Alone, however, this mechanism is not adequate for solving complex tasks since the robot also has to maintain selected pieces of past information. This necessitates AI agents and robots to have the capability to remember significant past events that are required for task completion. This paper presents an approach that considers focus of attention as a problem of selecting controller and feature pairs to be processed at any given point in time to optimize system performance. This approach is further extended by incorporating short term memory and a learned memory management policy. The result is a system that learns control, sensing, and memory policies that are task-specific and adaptable to real world situations using feedback from the world in a reinforcement learning framework. The approach is illustrated using table cleaning, sorting, star-king, and copying tasks in the blocks world domain.
引用
收藏
页码:303 / 327
页数:25
相关论文
共 50 条
  • [41] Convergence of Finite Memory Q Learning for POMDPs and Near Optimality of Learned Policies Under Filter Stability
    Kara, Ali Devran
    Yuksel, Serdar
    MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (04) : 2066 - 2093
  • [42] Collision-Avoiding Flocking With Multiple Fixed-Wing UAVs in Obstacle-Cluttered Environments: A Task-Specific Curriculum-Based MADRL Approach
    Yan, Chao
    Wang, Chang
    Xiang, Xiaojia
    Low, Kin Huat
    Wang, Xiangke
    Xu, Xin
    Shen, Lincheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10894 - 10908
  • [43] Learning Cost-Efficient Control Policies with XCSF: Generalization Capabilities and Further Improvement
    Marin, Didier
    Decock, Jeremie
    Rigoux, Lionel
    Sigaud, Olivier
    GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 1235 - 1242
  • [44] How to Ask for Donations? Learning User-Specific Persuasive Dialogue Policies through Online Interactions
    Tran, Nhat
    Alikhani, Malihe
    Litman, Diane
    PROCEEDINGS OF THE 30TH ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2022, 2022, : 12 - 22
  • [45] Online learning of task-driven object-based visual attention control
    Borji, Ali
    Ahmadabadi, Majid Nil
    Araabi, Babak Nadjar
    Hamidi, Mandana
    IMAGE AND VISION COMPUTING, 2010, 28 (07) : 1130 - 1145
  • [46] Shared Impedance Control Based on Reinforcement Learning in a Human-Robot Collaboration Task
    Wu, Min
    He, Yanhao
    Liu, Steven
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, 2020, 980 : 95 - 103
  • [47] A Decision Control Method for Autonomous Driving Based on Multi-Task Reinforcement Learning
    Cai, Yingfeng
    Yang, Shaoqing
    Wang, Hai
    Teng, Chenglong
    Chen, Long
    IEEE ACCESS, 2021, 9 (09): : 154553 - 154562
  • [48] Multi-agent reinforcement learning for redundant robot control in task-space
    Adolfo Perrusquía
    Wen Yu
    Xiaoou Li
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 231 - 241
  • [49] Effect of Attentional Focus on Muscles Activations and Their Recruitment During Learning a Balance Control Task
    Taleshi, Naser
    Taleshi, Mansour
    Dehghani, Sedigheh
    Bahrami, Fariba
    Jamshidi, Ali Ashraf
    2016 23RD IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING AND 2016 1ST INTERNATIONAL IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING (ICBME), 2016, : 307 - 310
  • [50] Cognitive Control Over Learning: Creating, Clustering, and Generalizing Task-Set Structure
    Collins, Anne G. E.
    Frank, Michael J.
    PSYCHOLOGICAL REVIEW, 2013, 120 (01) : 190 - 229