Contrastive Self-supervised Representation Learning Using Synthetic Data

被引:0
作者
Dong-Yu She
Kun Xu
机构
[1] Tsinghua University,Beijing National Research Center for Information Science and Technology, Department of Computer Science and Technology
来源
International Journal of Automation and Computing | 2021年 / 18卷
关键词
Self-supervised learning; contrastive learning; synthetic image; convolutional neural network; representation learning;
D O I
暂无
中图分类号
学科分类号
摘要
Learning discriminative representations with deep neural networks often relies on massive labeled data, which is expensive and difficult to obtain in many real scenarios. As an alternative, self-supervised learning that leverages input itself as supervision is strongly preferred for its soaring performance on visual representation learning. This paper introduces a contrastive self-supervised framework for learning generalizable representations on the synthetic data that can be obtained easily with complete controllability. Specifically, we propose to optimize a contrastive learning task and a physical property prediction task simultaneously. Given the synthetic scene, the first task aims to maximize agreement between a pair of synthetic images generated by our proposed view sampling module, while the second task aims to predict three physical property maps, i.e., depth, instance contour maps, and surface normal maps. In addition, a feature-level domain adaptation technique with adversarial training is applied to reduce the domain difference between the realistic and the synthetic data. Experiments demonstrate that our proposed method achieves state-of-the-art performance on several visual recognition datasets.
引用
收藏
页码:556 / 567
页数:11
相关论文
共 30 条
[1]  
Zhao B(2017)A survey on deep learning-based fine-grained object classification and semantic segmentation International Journal of Automation and Computing 14 119-135
[2]  
Feng J S(2019)Deep learning based single image super-resolution: A survey International Journal of Automation and Computing 16 413-426
[3]  
Wu X(2020)Localization and classification of rice-grain images using region proposals-based convolutional neural network International Journal of Automation and Computing 17 233-246
[4]  
Yan S(2006)Reducing the dimensionality of data with neural networks Science 313 504-507
[5]  
Ha V K(2017)Holistically-nested edge detection International Journal of Computer Vision 125 3-18
[6]  
Ren J C(2018)Places: A 10 million image database for scene recognition IEEE Transactions on Pattern Analysis and Machine Intelligence 40 1452-1464
[7]  
Xu X Y(2015)The pascal visual object classes challenge: A retrospective International Journal of Computer Vision 111 98-136
[8]  
Zhao S(undefined)undefined undefined undefined undefined-undefined
[9]  
Xie G(undefined)undefined undefined undefined undefined-undefined
[10]  
Masero V(undefined)undefined undefined undefined undefined-undefined