Self-Supervised Cross-Modal Online Learning of Basic Object Affordances for Developmental Robotic Systems

被引：31

作者：

Ridge, Barry ^{[1
]}

Skocaj, Danijel ^{[1
]}

Leonardis, Ales ^{[1
]}

机构：

[1] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana 61000, Slovenia

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2010年

关键词：

ENERGY MINIMIZATION; MODEL;

D O I：

10.1109/ROBOT.2010.5509544

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

For a developmental robotic system to function successfully in the real world, it is important that it be able to form its own internal representations of affordance classes based on observable regularities in sensory data. Usually successful classifiers are built using labeled training data, but it is not always realistic to assume that labels are available in a developmental robotics setting. There does, however, exist an advantage in this setting that can help circumvent the absence of labels: co-occurrence of correlated data across separate sensory modalities over time. The main contribution of this paper is an online classifier training algorithm based on Kohonen's learning vector quantization (LVQ) that, by taking advantage of this co-occurrence information, does not require labels during training, either dynamically generated or otherwise. We evaluate the algorithm in experiments involving a robotic arm that interacts with various household objects on a table surface where camera systems extract features for two separate visual modalities. It is shown to improve its ability to classify the affordances of novel objects over time, coming close to the performance of equivalent fully-supervised algorithms.

引用

页码：5047 / 5054

页数：8

共 50 条

[1] Self-Supervised Online Learning of Basic Object Push Affordances
Ridge, Barry
Leonardis, Ales
Ude, Ales
Denisa, Miha
Skocaj, Danijel
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2015, 12
[2] Self-Supervised Correlation Learning for Cross-Modal Retrieval
Liu, Yaxin
Wu, Jianlong
Qu, Leigang
Gan, Tian
Yin, Jianhua
Nie, Liqiang
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2851 - 2863
[3] Cross-modal self-supervised representation learning for gesture and skill recognition in robotic surgery
Wu, Jie Ying
Tamhane, Aniruddha
Kazanzides, Peter
Unberath, Mathias
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (05) : 779 - 787
[4] Cross-modal self-supervised representation learning for gesture and skill recognition in robotic surgery
Jie Ying Wu
Aniruddha Tamhane
Peter Kazanzides
Mathias Unberath
International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 779 - 787
[5] SELF-SUPERVISED LEARNING WITH CROSS-MODAL TRANSFORMERS FOR EMOTION RECOGNITION
Khare, Aparna
Parthasarathy, Srinivas
Sundaram, Shiva
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 381 - 388
[6] Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning
Das, Srijan
Ryoo, Michael
2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
[7] Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Alwassel, Humam
Mahajan, Dhruv
Korbar, Bruno
Torresani, Lorenzo
Ghanem, Bernard
Tran, Du
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[8] Self-supervised incomplete cross-modal hashing retrieval
Peng, Shouyong
Yao, Tao
Li, Ying
Wang, Gang
Wang, Lili
Yan, Zhiming
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 262
[9] Self-Supervised Visual Representations for Cross-Modal Retrieval
Patel, Yash
Gomez, Lluis
Rusinol, Marcal
Karatzas, Dimosthenis
Jawahar, C., V
ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 182 - 186
[10] Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
Salvador, Amaia
Gundogdu, Erhan
Bazzani, Loris
Donoser, Michael
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15470 - 15479

← 1 2 3 4 5 →