An Affordance Keypoint Detection Network for Robot Manipulation

被引:19
|
作者
Xu, Ruinian [1 ]
Chu, Fu-Jen [1 ]
Tang, Chao [1 ]
Liu, Weiyu [1 ]
Vela, Patricio A. [1 ]
机构
[1] Georgia Inst Technol, Inst Robot & Intelligent Machines, Atlanta, GA 30318 USA
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2021年 / 6卷 / 02期
基金
美国国家科学基金会;
关键词
Deep learning in grasping and manipulation; perception for grasping and manipulation; RGB-D perception;
D O I
10.1109/LRA.2021.3062560
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This letter investigates the addition of keypoint detections to a deep network affordance segmentation pipeline. The intent is to better interpret the functionality of object parts from a manipulation perspective. While affordance segmentation does provide label information about the potential use of object parts, it lacks predictions on the physical geometry that would support such use. The keypoints remedy the situation by providing structured predictions regarding position, direction, and extent. To support joint training of affordances and keypoints, a new dataset is created based on the UMD dataset. Called the UMD+GT affordance dataset, it emphasizes household objects and affordances. The dataset has a uniform representation for five keypoints that encodes information about where and how to manipulate the associated affordance. Visual processing benchmarking shows that the trained network, called AffKp, achieves the state-of-the-art performance on affordance segmentation and satisfactory result on keypoint detection. Manipulation experiments show more stable detection of the operating position for AffKp versus segmentation-only methods and the ability to infer object part pose and operating direction for task execution.
引用
收藏
页码:2870 / 2877
页数:8
相关论文
共 50 条
  • [21] Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation
    Ju, Yuanchen
    Hu, Kaizhe
    Zhang, Guowei
    Zhang, Gu
    Jiang, Mingrun
    Xu, Huazhe
    COMPUTER VISION - ECCV 2024, PT XLI, 2025, 15099 : 222 - 239
  • [22] Task-Oriented Robot Cognitive Manipulation Planning Using Affordance Segmentation and Logic Reasoning
    Wang, Zhongli
    Tian, Guohui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12172 - 12185
  • [23] Generalized Affordance Templates for Mobile Manipulation
    Hart, Stephen
    Quispe, Ana Huaman
    Lanighan, Michael W.
    Gee, Seth
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6240 - 6246
  • [24] KGDet: keypoint-guided trauma hemorrhage detection network
    Wu, Xue
    Hu, Min
    Zhang, Yaorong
    Luo, Jianguo
    Wang, Yanni
    Han, Shipeng
    JOURNAL OF ELECTRONIC IMAGING, 2025, 34 (01)
  • [25] An Efficient Method of Face and Keypoint Detection Based on Shared Network
    Tian, Xiaogang
    Fan, Xiaoye
    Tang, Feixue
    Cao, Xixin
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ADVANCED CONTROL, AUTOMATION AND ARTIFICIAL INTELLIGENCE (ACAAI 2018), 2018, 155 : 90 - 93
  • [26] A Keypoint-based Global Association Network for Lane Detection
    Wang, Jinsheng
    Ma, Yinchao
    Huang, Shaofei
    Hui, Tianrui
    Wang, Fei
    Qian, Chen
    Zhang, Tianzhu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1382 - 1391
  • [27] Manipulation Planning for Object Re-Orientation Based on Semantic Segmentation Keypoint Detection
    Wong, Ching-Chang
    Yeh, Li-Yu
    Liu, Chih-Cheng
    Tsai, Chi-Yi
    Aoyama, Hisasuki
    SENSORS, 2021, 21 (07)
  • [28] YOLOv8-PoseBoost: Advancements in Multimodal Robot Pose Keypoint Detection
    Wang, Feng
    Wang, Gang
    Lu, Baoli
    ELECTRONICS, 2024, 13 (06)
  • [29] Multimodal Detection and Classification of Robot Manipulation Failures
    Inceoglu, Arda
    Aksoy, Eren Erdal
    Sariel, Sanem
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02) : 1396 - 1403
  • [30] A New Semantic Edge Aware Network for Object Affordance Detection
    Yin, Congcong
    Zhang, Qiuju
    Ren, Wenqiang
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 104 (01)