An Affordance Keypoint Detection Network for Robot Manipulation

被引：19

作者：

Xu, Ruinian ^{[1
]}

Chu, Fu-Jen ^{[1
]}

Tang, Chao ^{[1
]}

Liu, Weiyu ^{[1
]}

Vela, Patricio A. ^{[1
]}

机构：

[1] Georgia Inst Technol, Inst Robot & Intelligent Machines, Atlanta, GA 30318 USA

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2021年 / 6卷 / 02期

基金：

美国国家科学基金会;

关键词：

Deep learning in grasping and manipulation; perception for grasping and manipulation; RGB-D perception;

D O I：

10.1109/LRA.2021.3062560

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

This letter investigates the addition of keypoint detections to a deep network affordance segmentation pipeline. The intent is to better interpret the functionality of object parts from a manipulation perspective. While affordance segmentation does provide label information about the potential use of object parts, it lacks predictions on the physical geometry that would support such use. The keypoints remedy the situation by providing structured predictions regarding position, direction, and extent. To support joint training of affordances and keypoints, a new dataset is created based on the UMD dataset. Called the UMD+GT affordance dataset, it emphasizes household objects and affordances. The dataset has a uniform representation for five keypoints that encodes information about where and how to manipulate the associated affordance. Visual processing benchmarking shows that the trained network, called AffKp, achieves the state-of-the-art performance on affordance segmentation and satisfactory result on keypoint detection. Manipulation experiments show more stable detection of the operating position for AffKp versus segmentation-only methods and the ability to infer object part pose and operating direction for task execution.

引用

页码：2870 / 2877

页数：8

共 50 条

[21] Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation
Ju, Yuanchen
Hu, Kaizhe
Zhang, Guowei
Zhang, Gu
Jiang, Mingrun
Xu, Huazhe
COMPUTER VISION - ECCV 2024, PT XLI, 2025, 15099 : 222 - 239
[22] Task-Oriented Robot Cognitive Manipulation Planning Using Affordance Segmentation and Logic Reasoning
Wang, Zhongli
Tian, Guohui
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12172 - 12185
[23] Generalized Affordance Templates for Mobile Manipulation
Hart, Stephen
Quispe, Ana Huaman
Lanighan, Michael W.
Gee, Seth
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6240 - 6246
[24] KGDet: keypoint-guided trauma hemorrhage detection network
Wu, Xue
Hu, Min
Zhang, Yaorong
Luo, Jianguo
Wang, Yanni
Han, Shipeng
JOURNAL OF ELECTRONIC IMAGING, 2025, 34 (01)
[25] An Efficient Method of Face and Keypoint Detection Based on Shared Network
Tian, Xiaogang
Fan, Xiaoye
Tang, Feixue
Cao, Xixin
PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ADVANCED CONTROL, AUTOMATION AND ARTIFICIAL INTELLIGENCE (ACAAI 2018), 2018, 155 : 90 - 93
[26] A Keypoint-based Global Association Network for Lane Detection
Wang, Jinsheng
Ma, Yinchao
Huang, Shaofei
Hui, Tianrui
Wang, Fei
Qian, Chen
Zhang, Tianzhu
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1382 - 1391
[27] Manipulation Planning for Object Re-Orientation Based on Semantic Segmentation Keypoint Detection
Wong, Ching-Chang
Yeh, Li-Yu
Liu, Chih-Cheng
Tsai, Chi-Yi
Aoyama, Hisasuki
SENSORS, 2021, 21 (07)
[28] YOLOv8-PoseBoost: Advancements in Multimodal Robot Pose Keypoint Detection
Wang, Feng
Wang, Gang
Lu, Baoli
ELECTRONICS, 2024, 13 (06)
[29] Multimodal Detection and Classification of Robot Manipulation Failures
Inceoglu, Arda
Aksoy, Eren Erdal
Sariel, Sanem
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02) : 1396 - 1403
[30] A New Semantic Edge Aware Network for Object Affordance Detection
Yin, Congcong
Zhang, Qiuju
Ren, Wenqiang
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 104 (01)

← 1 2 3 4 5 →