Robot Learning by Collaborative Network Training: A Self-Supervised Method using Ranking

被引:0
|
作者
Bretan, Mason [1 ]
Oore, Sageev [2 ,3 ]
Sanan, Siddharth [1 ]
Heck, Larry [1 ]
机构
[1] Samsung Res Amer, Mountain View, CA 94043 USA
[2] Dalhousie Univ, Halifax, NS, Canada
[3] Vector Inst, Halifax, NS, Canada
关键词
Robot learning; collaborative network training; controls; NEURAL-NETWORK;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We introduce Collaborative Network Training - a self-supervised method for training neural networks with aims of: 1) enabling task objective functions that are not directly differentiable w.r.t. the network output; 2) generating continuous-space actions; 3) more direct optimization for achieving a desired task; 4) learning parameters when a process for measuring performance is available, but labeled data is unavailable. The procedure involves three randomly initialized independent networks that use ranking to train one another on a single task. The method incorporates qualities from ensemble and reinforcement learning as well as gradient free optimization methods such as Nelder-Mead. We evaluate the method against various baselines using a variety of robotics-related tasks including inverse kinematics, controls, and planning in both simulated and real-world environments.
引用
收藏
页码:1333 / 1340
页数:8
相关论文
共 50 条
  • [31] Vicsgaze: a gaze estimation method using self-supervised contrastive learning
    Gu, De
    Lv, Minghao
    Liu, Jianchu
    MULTIMEDIA SYSTEMS, 2024, 30 (06)
  • [32] Embodied Self-Supervised Learning (EMSSL) with Sampling and Training Coordination for Robot Arm Inverse Kinematic Model Learning
    Qu Weiming
    Liu Tianlin
    Wu Xihong
    Luo Dingsheng
    2023 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, ICDL, 2023, : 100 - 106
  • [33] A Self-Supervised Deep Learning Method for Seismic Data Deblending Using a Blind-Trace Network
    Wang, Shirui
    Hu, Wenyi
    Yuan, Pengyu
    Wu, Xuqing
    Zhang, Qunshan
    Nadukandi, Prashanth
    Botero, German Ocampo
    Chen, Jiefu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3405 - 3414
  • [34] SELF-SUPERVISED ADVERSARIAL TRAINING
    Chen, Kejiang
    Chen, Yuefeng
    Zhou, Hang
    Mao, Xiaofeng
    Li, Yuhong
    He, Yuan
    Xue, Hui
    Zhang, Weiming
    Yu, Nenghai
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2218 - 2222
  • [35] Respiratory sound classification using supervised and self-supervised learning
    Lee, Sunju
    Ha, Taeyoung
    Hyon, YunKyong
    Chung, Chaeuk
    Kim, Yoonjoo
    Woo, Seong-Dae
    Lee, Song-I
    RESPIROLOGY, 2023, 28 : 160 - 161
  • [36] Diffraction denoising using self-supervised learning
    Markovic, Magdalena
    Malehmir, Reza
    Malehmir, Alireza
    GEOPHYSICAL PROSPECTING, 2023, 71 (07) : 1215 - 1225
  • [37] Self-supervised pose estimation method for a mobile robot in greenhouse
    Zhou Y.
    Xu T.
    Deng H.
    Miao T.
    Wu Q.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2021, 37 (09): : 263 - 274
  • [38] Self-Supervised Self-Organizing Clustering Network: A Novel Unsupervised Representation Learning Method
    Li, Shuo
    Liu, Fang
    Jiao, Licheng
    Chen, Puhua
    Li, Lingling
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 1857 - 1871
  • [39] TacoPrompt: A Collaborative Multi-Task Prompt Learning Method for Self-Supervised Taxonomy Completion
    Xu, Hongyuan
    Liu, Ciyi
    Niu, Yuhang
    Chen, Yunong
    Cai, Xiangrui
    Wen, Yanlong
    Yuan, Xiaojie
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 15804 - 15817
  • [40] SELF-SUPERVISED LEARNING OF DEPTH AND POSE USING CYCLE GENERATIVE ADVERSARIAL NETWORK
    Tong, Yunhe
    Wang, Anjie
    Tan, Songchao
    Wang, Shanshe
    Ma, Siwei
    Gao, Wen
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 738 - 742