Cross Pixel Optical-Flow Similarity for Self-supervised Learning

被引:24
作者
Mahendran, Aravindh [1 ]
Thewlis, James [1 ]
Vedaldi, Andrea [1 ]
机构
[1] Univ Oxford, Visual Geometry Grp, Oxford, England
来源
COMPUTER VISION - ACCV 2018, PT V | 2019年 / 11365卷
基金
英国工程与自然科学研究理事会;
关键词
Self-supervised learning; Motion; Convolutional neural network;
D O I
10.1007/978-3-030-20873-8_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel method for learning convolutional neural image representations without manual supervision. We use motion cues in the form of optical-flow, to supervise representations of static images. The obvious approach of training a network to predict flow from a single image can be needlessly difficult due to intrinsic ambiguities in this prediction task. We instead propose a much simpler learning goal: embed pixels such that the similarity between their embeddings matches that between their optical-flow vectors. At test time, the learned deep network can be used without access to video or flow information and transferred to tasks such as image classification, detection, and segmentation. Our method, which significantly simplifies previous attempts at using motion for self-supervision, achieves state-of-the-art results in self-supervision using motion cues, and is overall state of the art in self-supervised pre-training for semantic image segmentation, as demonstrated on standard benchmarks.
引用
收藏
页码:99 / 116
页数:18
相关论文
共 50 条
[41]   Self-Supervised Multimodal Learning: A Survey [J].
Zong, Yongshuo ;
Aodha, Oisin Mac ;
Hospedales, Timothy M. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (07) :5299-5318
[42]   Self-supervised Learning for CT Deconvolution [J].
Sudhakar, Prasad ;
Langoju, Rajesh ;
Agrawal, Utkarsh ;
Patil, Bhushan D. ;
Narayanan, Ajay ;
Chaugule, Vinay ;
Amilneni, Vinod ;
Cheerankal, Paul ;
Das, Bipul .
MEDICAL IMAGING 2021: PHYSICS OF MEDICAL IMAGING, 2021, 11595
[43]   Self-Supervised Learning for User Localization [J].
Dash, Ankan ;
Gu, Jingyi ;
Wang, Guiling ;
Ansari, Nirwan .
2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, :886-890
[44]   A Survey on Contrastive Self-Supervised Learning [J].
Jaiswal, Ashish ;
Babu, Ashwin Ramesh ;
Zadeh, Mohammad Zaki ;
Banerjee, Debapriya ;
Makedon, Fillia .
TECHNOLOGIES, 2021, 9 (01)
[45]   Self-Supervised learning for Conversational Recommendation [J].
Li, Shuokai ;
Xie, Ruobing ;
Zhu, Yongchun ;
Zhuang, Fuzhen ;
Tang, Zhenwei ;
Zhao, Wayne Xin ;
He, Qing .
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (06)
[46]   Self-supervised learning with ensemble representations [J].
Han, Kyoungmin ;
Lee, Minsik .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 143
[47]   Synergistic Self-supervised and Quantization Learning [J].
Cao, Yun-Hao ;
Sun, Peiqin ;
Huang, Yechang ;
Wu, Jianxin ;
Zhou, Shuchang .
COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 :587-604
[48]   Self-Supervised Saliency Estimation for Pixel Embedding in Road Detection [J].
Zhou, Di ;
Tian, Yan ;
Chen, Wei-Gang ;
Huang, Gang .
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 (28) :1325-1329
[49]   Learning pose regression as reliable pixel-level matching for self-supervised depth estimation [J].
He, Junwen ;
Wang, Yifan ;
Wang, Lijun ;
Lu, Huchuan .
NEUROCOMPUTING, 2025, 638
[50]   Self-Adaptive Training: Bridging Supervised and Self-Supervised Learning [J].
Huang, Lang ;
Zhang, Chao ;
Zhang, Hongyang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) :1362-1377