Learning What and Where to Learn: A New Perspective on Self-Supervised Learning

被引:4
|
作者
Zhao, Wenyi [1 ]
Yang, Lu [1 ]
Zhang, Weidong [2 ]
Tian, Yongqin [2 ]
Jia, Wenhe [1 ]
Li, Wei [1 ]
Yang, Mu [3 ]
Pan, Xipeng [4 ]
Yang, Huihua [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Henan Inst Sci & Technol, Sch Informat Engn, Xinxiang 453003, Peoples R China
[3] Techmach Beijing Ind Technol Co Ltd, Beijing 102676, Peoples R China
[4] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Feature extraction; Task analysis; Computational modeling; Optimization; Self-supervised learning; Training; learning what; learning where; efficient framework; positional information;
D O I
10.1109/TCSVT.2023.3298937
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Self-supervised learning (SSL) has demonstrated its power in generalized model acquisition by leveraging the discriminative semantic and explicit positional information of unlabeled datasets. Unfortunately, mainstream contrastive learning-based methods excessive focus on semantic information and ignore the position is also the carrier of image content, resulting in inadequate data utilization and extensive computational consumption. To address these issues, we present an efficient SSL framework, learning What and Where to learn ( $\text {W}<^>{2} \text {SSL}$ ), to aggregate semantic and position features. Concretely, we devise a spatially-coupled sampling manner to process images through pre-defined rules, which integrates the advantage of semantic (What) and positional (Where) features into framework to enrich the diversity of feature representation capabilities and improve data utilization. Besides, a spectrum of latent vectors is obtained by mapping the positional features, which implicitly explores the relationship between these vectors. Whereafter, the corresponding discriminative and contrastive optimization objectives are seamlessly embedded in the framework via a cascade paradigm to explore semantic and positional features. The proposed $\text {W}<^>{2} \text {SSL}$ is verified on different types of datasets, which demonstrates that it still outperforms state-of-the-art SSL methods even with half the computational consumption. Code will be available at https://github.com/WilyZhao8/W2SSL.
引用
收藏
页码:6620 / 6633
页数:14
相关论文
共 50 条
  • [21] Gated Self-supervised Learning for Improving Supervised Learning
    Fuadi, Erland Hillman
    Ruslim, Aristo Renaldo
    Wardhana, Putu Wahyu Kusuma
    Yudistira, Novanto
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 611 - 615
  • [22] Graph Self-Supervised Learning: A Survey
    Liu, Yixin
    Jin, Ming
    Pan, Shirui
    Zhou, Chuan
    Zheng, Yu
    Xia, Feng
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5879 - 5900
  • [23] Self-Supervised Learning Across Domains
    Bucci, Silvia
    D'Innocente, Antonio
    Liao, Yujun
    Carlucci, Fabio Maria
    Caputo, Barbara
    Tommasi, Tatiana
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5516 - 5528
  • [24] Self-Supervised Learning in Remote Sensing
    Wang, Yi
    Albrecht, Conrad M.
    Ait Ali Braham, Nassim
    Mou, Lichao
    Zhu, Xiao Xiang
    IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2022, 10 (04) : 213 - 247
  • [25] Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey
    Jing, Longlong
    Tian, Yingli
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (11) : 4037 - 4058
  • [26] Self-Supervised Visual Learning by Variable Playback Speeds Prediction of a Video
    Cho, Hyeon
    Kim, Taehoon
    Chang, Hyung Jin
    Hwang, Wonjun
    IEEE ACCESS, 2021, 9 : 79562 - 79571
  • [27] Self-Supervised Graphs for Audio Representation Learning With Limited Labeled Data
    Shirian, Amir
    Somandepalli, Krishna
    Guha, Tanaya
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1391 - 1401
  • [28] Classification of Polyps in Endoscopic Images Using Self-Supervised Structured Learning
    Huang, Qi-Xian
    Lin, Guo-Shiang
    Sun, Hung-Min
    IEEE ACCESS, 2023, 11 : 50025 - 50037
  • [29] Self-Supervised Learning for Few-Shot Medical Image Segmentation
    Ouyang, Cheng
    Biffi, Carlo
    Chen, Chen
    Kart, Turkay
    Qiu, Huaqi
    Rueckert, Daniel
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (07) : 1837 - 1848
  • [30] Self-Supervised Learning Malware Traffic Classification Based on Masked Autoencoder
    Xu, Ke
    Zhang, Xixi
    Wang, Yu
    Ohtsuki, Tomoaki
    Adebisi, Bamidele
    Sari, Hikmet
    Gui, Guan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (10): : 17330 - 17340