Visual Affordance Prediction for Guiding Robot Exploration

被引:2
作者
Bharadhwaj, Homanga [1 ]
Gupta, Abhinav [1 ]
Tulsiani, Shubham [1 ]
机构
[1] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
来源
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA | 2023年
关键词
D O I
10.1109/ICRA48891.2023.10161288
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Motivated by the intuitive understanding humans have about the space of possible interactions, and the ease with which they can generalize this understanding to previously unseen scenes, we develop an approach for learning 'visual affordances'. Given an input image of a scene, we infer a distribution over plausible future states that can be achieved via interactions with it. To allow predicting diverse plausible futures, we discretize the space of continuous images with a VQ-VAE and use a Transformer-based model to learn a conditional distribution in the latent embedding space. We show that these models can be trained using large-scale and diverse passive data, and that the learned models exhibit compositional generalization to diverse objects beyond the training distribution. We evaluate the quality and diversity of the generations, and demonstrate how the trained affordance model can be used for guiding exploration during visual goal-conditioned policy learning in robotic manipulation.
引用
收藏
页码:3029 / 3036
页数:8
相关论文
共 50 条
  • [21] Learned Map Prediction for Enhanced Mobile Robot Exploration
    Shrestha, Rakesh
    Tian, Fei-Peng
    Feng, Wei
    Tan, Ping
    Vaughan, Richard
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 1197 - 1204
  • [22] Pedestrian Density Prediction for Efficient Mobile Robot Exploration
    Zapf, Marc Patrick
    Kawanabe, Motoaki
    Saiki, Luis Yoichi Morales
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4615 - 4622
  • [23] Intention Prediction is Affordance Perception
    Alhasan, Ayeh
    Kallen, Rachel W.
    Richardson, Michael J.
    ECOLOGICAL PSYCHOLOGY, 2025, 37 (01) : 21 - 35
  • [24] The affordance structure matrix - A concept exploration and attention directing tool for affordance based design
    Maier, Jonathan R. A.
    Ezhilan, Thulasiram
    Fadel, Georges M.
    19TH INTERNATIONAL CONFERENCE ON DESIGN THEORY AND METHODOLOGY/1ST INTERNATIONAL CONFERENCE ON MICRO AND NANO SYSTEMS, VOL 3, PART A AND B, 2008, : 277 - 287
  • [25] Learning to Detect Visual Grasp Affordance
    Song, Hyun Oh
    Fritz, Mario
    Goehring, Daniel
    Darrell, Trevor
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (02) : 798 - 809
  • [26] Modulation of visual attention by object affordance
    Garrido-Vasquez, Patricia
    Schuboe, Anna
    FRONTIERS IN PSYCHOLOGY, 2014, 5
  • [27] Visual Affordance and Function Understanding: A Survey
    Hassanin, Mohammed
    Khan, Salman
    Tahtali, Murat
    ACM COMPUTING SURVEYS, 2022, 54 (03)
  • [28] Visual Exploration and Analysis of Human-Robot Interaction Rules
    Zhang, Hui
    Boyles, Michael J.
    VISUALIZATION AND DATA ANALYSIS 2013, 2013, 8654
  • [29] Effects of broken affordance on visual extinction
    Wulff, Melanie
    Humphreys, Glyn W.
    FRONTIERS IN HUMAN NEUROSCIENCE, 2015, 9
  • [30] Visual learning of affordance based cues
    Fritz, Gerald
    Paletta, Lucas
    Kumar, Manish
    Dorffner, Georg
    Breithaupt, Ralph
    Rome, Erich
    FROM ANIMALS TO ANIMATS 9, PROCEEDINGS, 2006, 4095 : 52 - 64