Active Vision for Robot Manipulators Using the Free Energy Principle

被引:16
作者
Van de Maele, Toon [1 ]
Verbelen, Tim [1 ]
Catal, Ozan [1 ]
De Boom, Cedric [1 ]
Dhoedt, Bart [1 ]
机构
[1] Univ Ghent, IMEC, Dept Informat Technol, IDLab, Ghent, Belgium
来源
FRONTIERS IN NEUROROBOTICS | 2021年 / 15卷
关键词
active vision; active inference; deep learning; generative modeling; robotics; INFERENCE; RECONSTRUCTION; CONSTRUCTION;
D O I
10.3389/fnbot.2021.642780
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Occlusions, restricted field of view and limited resolution all constrain a robot's ability to sense its environment from a single observation. In these cases, the robot first needs to actively query multiple observations and accumulate information before it can complete a task. In this paper, we cast this problem of active vision as active inference, which states that an intelligent agent maintains a generative model of its environment and acts in order to minimize its surprise, or expected free energy according to this model. We apply this to an object-reaching task for a 7-DOF robotic manipulator with an in-hand camera to scan the workspace. A novel generative model using deep neural networks is proposed that is able to fuse multiple views into an abstract representation and is trained from data by minimizing variational free energy. We validate our approach experimentally for a reaching task in simulation in which a robotic agent starts without any knowledge about its workspace. Each step, the next view pose is chosen by evaluating the expected free energy. We find that by minimizing the expected free energy, exploratory behavior emerges when the target object to reach is not in view, and the end effector is moved to the correct reach position once the target is located. Similar to an owl scavenging for prey, the robot naturally prefers higher ground for exploring, approaching its target once located.
引用
收藏
页数:18
相关论文
共 64 条
  • [21] Fraundorfer F, 2012, IEEE INT C INT ROBOT, P4557, DOI 10.1109/IROS.2012.6385934
  • [22] Active inference and learning
    Friston, Karl
    FitzGerald, Thomas
    Rigoli, Francesco
    Schwartenbeck, Philipp
    O'Doherty, John
    Pezzulo, Giovanni
    [J]. NEUROSCIENCE AND BIOBEHAVIORAL REVIEWS, 2016, 68 : 862 - 879
  • [23] Life as we know it
    Friston, Karl
    [J]. JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2013, 10 (86)
  • [24] The free-energy principle: a unified brain theory?
    Friston, Karl J.
    [J]. NATURE REVIEWS NEUROSCIENCE, 2010, 11 (02) : 127 - 138
  • [25] Gregor K, 2015, PR MACH LEARN RES, V37, P1462
  • [26] Hadsell R., 2006, IEEE C COMP VIS PATT, P1735, DOI DOI 10.1109/CVPR.2006.100
  • [27] Deep Active Inference and Scene Construction
    Heins, R. Conor
    Mirza, M. Berk
    Parr, Thomas
    Friston, Karl
    Kagan, Igor
    Pooresmaeili, Arezoo
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2020, 3
  • [28] Heljakka Ari, 2018, arXiv
  • [29] Heljakka A, 2020, IEEE WINT CONF APPL, P3109, DOI [10.1109/wacv45572.2020.9093375, 10.1109/WACV45572.2020.9093375]
  • [30] Huang HB, 2018, ADV NEUR IN, V31