Daily Assistive View Control Learning of Low-Cost Low-Rigidity Robot via Large-Scale Vision-Language Model

被引:0
作者
Kawaharazuka, Kento [1 ]
Kanazawa, Naoaki [1 ]
Obinata, Yoshiki [1 ]
Okada, Kei [1 ]
Inaba, Masayuki [1 ]
机构
[1] Univ Tokyo, Sch Informat Sci & Technol, Dept Mechano Inform, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138656, Japan
来源
2023 IEEE-RAS 22ND INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, HUMANOIDS | 2023年
关键词
D O I
10.1109/HUMANOIDS57100.2023.10375239
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, we develop a simple daily assistive robot that controls its own vision according to linguistic instructions. The robot performs several daily tasks such as recording a user's face, hands, or screen, and remotely capturing images of desired locations. To construct such a robot, we combine a pretrained large-scale vision-language model with a low-cost lowrigidity robot arm. The correlation between the robot's physical and visual information is learned probabilistically using a neural network, and changes in the probability distribution based on changes in time and environment are considered by parametric bias, which is a learnable network input variable. We demonstrate the effectiveness of this learning method by open-vocabulary view control experiments with an actual robot arm, MyCobot.
引用
收藏
页数:6
相关论文
共 22 条
  • [1] Ahn M, 2022, Arxiv, DOI arXiv:2204.01691
  • [2] Bircher A, 2016, IEEE INT CONF ROBOT, P1462, DOI 10.1109/ICRA.2016.7487281
  • [3] Connolly Cl, 1985, P IEEE INT C ROB AUT, V2, P432, DOI DOI 10.1109/ROBOT.1985.1087372
  • [4] Embodied Question Answering
    Das, Abhishek
    Datta, Samyak
    Gkioxari, Georgia
    Lee, Stefan
    Parikh, Devi
    Batra, Dhruv
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1 - 10
  • [5] Hatori J, 2018, IEEE INT CONF ROBOT, P3774
  • [6] Automatic Surgical Field of View Control in Robot-Assisted Nasal Surgery
    He, Yucheng
    Zhao, Baoliang
    Qi, Xiaozhi
    Li, Shibo
    Yang, Yuanyuan
    Hu, Ying
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (01) : 247 - 254
  • [7] Huang CG, 2022, Arxiv, DOI arXiv:2210.05714
  • [8] Kawaharazuka K., 2023, P 2023 IEEE RAS INT
  • [9] VQA-based Robotic State Recognition Optimized with Genetic Algorithm
    Kawaharazuka, Kento
    Obinata, Yoshiki
    Kanazawa, Naoaki
    Okada, Kei
    Inaba, Masayuki
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 8306 - 8311
  • [10] Environmentally Adaptive Control Including Variance Minimization Using Stochastic Predictive Network with Parametric Bias: Application to Mobile Robots
    Kawaharazuka, Kento
    Shinjo, Koki
    Kawamura, Yoichiro
    Okada, Kei
    Inaba, Masayuki
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 8404 - 8410