Self-supervised learning for multi-view stereo

被引:0
|
作者
Ito S.
Kaneko N.
Sumi K.
机构
关键词
Deep neural network; Depth estimation; Multi-view stereo; Self-supervised learning;
D O I
10.2493/jjspe.86.1042
中图分类号
学科分类号
摘要
Recent learning-based multi-view stereo (MVS) approaches have shown excellent performance. These approaches typically train a deep neural network to estimate dense depth maps from multiple images. However, most of these approaches require large-scale dense depth maps as the supervisory signals during training. This paper proposes a self-supervised learning framework for MVS, which learns to estimate dense depth maps from multiple images without dense depth supervision. Taking an arbitrary number of images as input, we produce sparse depth maps using structure from motion and use it as self-supervision. We apply reconstraction and smoothness losses to regions where there is no sparse depth. For stable training, we introduce a pseudo-depth loss, which is the difference between depth maps estimated by the network with the current and past parameters. Experimental results on multiple datasets demonstrate the effectiveness of our self-supervised learning framework. © 2020 Japan Society for Precision Engineering. All rights reserved.
引用
收藏
页码:1042 / 1050
页数:8
相关论文
共 50 条
  • [41] Time-Contrastive Networks: Self-Supervised Learning from Multi-View Observation
    Sermanet, Pierre
    Lynch, Corey
    Hsu, Jasmine
    Levine, Sergey
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 486 - 487
  • [42] Multi-view Self-supervised Learning and Multi-scale Feature Fusion for Automatic Speech Recognition
    Zhao, Jingyu
    Li, Ruwei
    Tian, Maocun
    An, Weidong
    NEURAL PROCESSING LETTERS, 2024, 56 (04)
  • [43] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
    Shuai Bi
    Zhengping Hu
    Mengyao Zhao
    Hehao Zhang
    Jirui Di
    Zhe Sun
    Signal, Image and Video Processing, 2023, 17 : 3775 - 3782
  • [44] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
    Bi, Shuai
    Hu, Zhengping
    Zhao, Mengyao
    Zhang, Hehao
    Di, Jirui
    Sun, Zhe
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (07) : 3775 - 3782
  • [45] Multi-view Contrastive Self-Supervised Learning of Accounting Data Representations for Downstream Audit Tasks
    Schreyer, Marco
    Sattarov, Timur
    Borth, Damian
    ICAIF 2021: THE SECOND ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, 2021,
  • [46] Self-Supervised Information Bottleneck for Deep Multi-View Subspace Clustering
    Wang, Shiye
    Li, Changsheng
    Li, Yanming
    Yuan, Ye
    Wang, Guoren
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 (1555-1567) : 1555 - 1567
  • [47] Self-Supervised, Multi-View, Semantics-Aware Anchor Clustering
    Wei, Kaibin
    Li, Haifeng
    Liu, Qing
    Zhang, Xiongjian
    ELECTRONICS, 2024, 13 (23):
  • [48] Self-supervised Learning with Multi-view Rendering for 3D Point Cloud Analysis
    Tran, Bach
    Hua, Binh-Son
    Tran, Anh Tuan
    Hoai, Minh
    COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 413 - 431
  • [49] Exploring Self-Supervised Multi-view Contrastive Learning for Speech Emotion Recognition with Limited Annotations
    Khaertdinov, Bulat
    Jeuris, Pedro
    Sousa, Annanda
    Hortal, Enrique
    INTERSPEECH 2024, 2024, : 4708 - 4712
  • [50] Semi-Supervised and Self-Supervised Classification with Multi-View Graph Neural Networks
    Yuan, Jinliang
    Yu, Hualei
    Cao, Meng
    Xu, Ming
    Xie, Junyuan
    Wang, Chongjun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2466 - 2476