Learning What and Where to Learn: A New Perspective on Self-Supervised Learning

被引：4

作者：

Zhao, Wenyi ^{[1
]}

Yang, Lu ^{[1
]}

Zhang, Weidong ^{[2
]}

Tian, Yongqin ^{[2
]}

Jia, Wenhe ^{[1
]}

Li, Wei ^{[1
]}

Yang, Mu ^{[3
]}

Pan, Xipeng ^{[4
]}

Yang, Huihua ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China

[2] Henan Inst Sci & Technol, Sch Informat Engn, Xinxiang 453003, Peoples R China

[3] Techmach Beijing Ind Technol Co Ltd, Beijing 102676, Peoples R China

[4] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Semantics; Feature extraction; Task analysis; Computational modeling; Optimization; Self-supervised learning; Training; learning what; learning where; efficient framework; positional information;

D O I：

10.1109/TCSVT.2023.3298937

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Self-supervised learning (SSL) has demonstrated its power in generalized model acquisition by leveraging the discriminative semantic and explicit positional information of unlabeled datasets. Unfortunately, mainstream contrastive learning-based methods excessive focus on semantic information and ignore the position is also the carrier of image content, resulting in inadequate data utilization and extensive computational consumption. To address these issues, we present an efficient SSL framework, learning What and Where to learn ( $\text {W}<^>{2} \text {SSL}$ ), to aggregate semantic and position features. Concretely, we devise a spatially-coupled sampling manner to process images through pre-defined rules, which integrates the advantage of semantic (What) and positional (Where) features into framework to enrich the diversity of feature representation capabilities and improve data utilization. Besides, a spectrum of latent vectors is obtained by mapping the positional features, which implicitly explores the relationship between these vectors. Whereafter, the corresponding discriminative and contrastive optimization objectives are seamlessly embedded in the framework via a cascade paradigm to explore semantic and positional features. The proposed $\text {W}<^>{2} \text {SSL}$ is verified on different types of datasets, which demonstrates that it still outperforms state-of-the-art SSL methods even with half the computational consumption. Code will be available at https://github.com/WilyZhao8/W2SSL.

引用

页码：6620 / 6633

页数：14

共 50 条

[21] Gated Self-supervised Learning for Improving Supervised Learning
Fuadi, Erland Hillman
Ruslim, Aristo Renaldo
Wardhana, Putu Wahyu Kusuma
Yudistira, Novanto
2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 611 - 615
[22] Graph Self-Supervised Learning: A Survey
Liu, Yixin
Jin, Ming
Pan, Shirui
Zhou, Chuan
Zheng, Yu
Xia, Feng
Yu, Philip S.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5879 - 5900
[23] Self-Supervised Learning Across Domains
Bucci, Silvia
D'Innocente, Antonio
Liao, Yujun
Carlucci, Fabio Maria
Caputo, Barbara
Tommasi, Tatiana
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5516 - 5528
[24] Self-Supervised Learning in Remote Sensing
Wang, Yi
Albrecht, Conrad M.
Ait Ali Braham, Nassim
Mou, Lichao
Zhu, Xiao Xiang
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2022, 10 (04) : 213 - 247
[25] Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey
Jing, Longlong
Tian, Yingli
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (11) : 4037 - 4058
[26] Self-Supervised Visual Learning by Variable Playback Speeds Prediction of a Video
Cho, Hyeon
Kim, Taehoon
Chang, Hyung Jin
Hwang, Wonjun
IEEE ACCESS, 2021, 9 : 79562 - 79571
[27] Self-Supervised Graphs for Audio Representation Learning With Limited Labeled Data
Shirian, Amir
Somandepalli, Krishna
Guha, Tanaya
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1391 - 1401
[28] Classification of Polyps in Endoscopic Images Using Self-Supervised Structured Learning
Huang, Qi-Xian
Lin, Guo-Shiang
Sun, Hung-Min
IEEE ACCESS, 2023, 11 : 50025 - 50037
[29] Self-Supervised Learning for Few-Shot Medical Image Segmentation
Ouyang, Cheng
Biffi, Carlo
Chen, Chen
Kart, Turkay
Qiu, Huaqi
Rueckert, Daniel
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (07) : 1837 - 1848
[30] Self-Supervised Learning Malware Traffic Classification Based on Masked Autoencoder
Xu, Ke
Zhang, Xixi
Wang, Yu
Ohtsuki, Tomoaki
Adebisi, Bamidele
Sari, Hikmet
Gui, Guan
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (10): : 17330 - 17340

← 1 2 3 4 5 →