A Self-Supervised Learning Solution With Momentum-Based Siamese Network for Robotic Visual Task

被引:0
|
作者
Huang, Yuehong [1 ]
Tseng, Yu-Chee [1 ,2 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu 30010, Taiwan
[2] Natl Yang Ming Chiao Tung Univ, Coll Artificial Intelligence, Tainan 71150, Taiwan
关键词
Artificial intelligence; causality; cause-and-effect; momentum-based Siamese network; self-supervised;
D O I
10.1109/ACCESS.2023.3321276
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
How a robot induces causality, abstracts information from states, and plans for tasks are major challenges in artificial intelligence. However, habitually used deep reinforcement learning is not a cure-all solution. This study proposes MoSN, a momentum-based Siamese network for a robotic visual task. Instead of trying to learn representations that abstract all the variation factors of the environment as in most prior studies, MoSN labels its observations as their causes so that it can learn representations of visual states in a self-supervised manner. These representations are then used to enhance perceptual and cognitive abilities and to perform our robotic visual task when a robot takes over a new environment. In particular, this study proposes a momentum-based Siamese network with augmentation methods and a hybrid explicit contrastive loss to improve the combinatorial generalization capability of our model. We validate MoSN in real-world environments equipped with lights and light switches and attempt to derive the cause-and-effect relationship between switch states and visual states. MoSN is compared with previous methods in both static and dynamic environments with both fixed and dynamic viewing angles. MoSN outperforms previous methods in most cases.
引用
收藏
页码:112764 / 112775
页数:12
相关论文
共 50 条
  • [1] Siamese Network Based Multiscale Self-Supervised Heterogeneous Graph Representation Learning
    Chen, Zijun
    Luo, Lihui
    Li, Xunkai
    Jiang, Bin
    Guo, Qiang
    Wang, Chunpeng
    IEEE ACCESS, 2022, 10 : 98490 - 98500
  • [2] Group-based siamese self-supervised learning
    Li, Zhongnian
    Wang, Jiayu
    Geng, Qingcong
    Xu, Xinzheng
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (08): : 4913 - 4925
  • [3] Multi-task Self-Supervised Visual Learning
    Doersch, Carl
    Zisserman, Andrew
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2070 - 2079
  • [4] Siamese self-supervised learning for fine-grained visual classification
    Ji, Ruyi
    Li, Jiaying
    Zhang, Libo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 229
  • [5] SELF-SUPERVISED SPEAKER VERIFICATION WITH SIMPLE SIAMESE NETWORK AND SELF-SUPERVISED REGULARIZATION
    Sang, Mufan
    Li, Haoqi
    Liu, Fang
    Arnold, Andrew O.
    Wan, Li
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6127 - 6131
  • [6] Self-Supervised Visual Representation Learning via Residual Momentum
    Pham, Trung Xuan
    Niu, Axi
    Zhang, Kang
    Jin, Tee Joshua Tian
    Hong, Ji Woo
    Yoo, Chang D.
    IEEE ACCESS, 2023, 11 : 116706 - 116720
  • [7] A Self-Supervised Solution for the Switch-Toggling Visual Task
    Huang, Yuehong
    Tseng, Yu-Chee
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3429 - 3435
  • [8] LADDER SIAMESE NETWORK: A METHOD AND INSIGHTS FOR MULTI-LEVEL SELF-SUPERVISED LEARNING
    Yoshihashi, Ryota
    Nishimura, Shuhei
    Yonebayashi, Dai
    Otsuka, Yuya
    Tanaka, Tomohiro
    Miyazaki, Takashi
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 211 - 215
  • [9] Self-Supervised Marine Video Analysis via Siamese Network
    Liang, Ju
    Song, Jihan
    Li, Qianqian
    Shi, Zhensheng
    Gu, Zhaorui
    Zheng, Haiyong
    Zheng, Bing
    OCEANS 2021: SAN DIEGO - PORTO, 2021,
  • [10] S2SiamFC: Self-supervised Fully Convolutional Siamese Network for Visual Tracking
    Sio, Chon Hou
    Ma, Yu-Jen
    Shuai, Hong-Han
    Chen, Jun-Cheng
    Cheng, Wen-Huang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1948 - 1957