Disentangling Light Fields for Super-Resolution and Disparity Estimation

被引:141
作者
Wang, Yingqian [1 ]
Wang, Longguang [1 ]
Wu, Gaochang [2 ]
Yang, Jungang [1 ]
An, Wei [1 ]
Yu, Jingyi [3 ]
Guo, Yulan [1 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Hunan, Peoples R China
[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[3] ShanghaiTech Univ, Sch Informat Sci & Technol, Pudong 201210, Peoples R China
关键词
Light field image processing; feature disentangling; image super-resolution; view synthesis; disparity estimation; EPIPOLAR GEOMETRY; NETWORK; DEPTH; SHAPE;
D O I
10.1109/TPAMI.2022.3152488
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Light field (LF) cameras record both intensity and directions of light rays, and encode 3D scenes into 4D LF images. Recently, many convolutional neural networks (CNNs) have been proposed for various LF image processing tasks. However, it is challenging for CNNs to effectively process LF images since the spatial and angular information are highly inter-twined with varying disparities. In this paper, we propose a generic mechanism to disentangle these coupled information for LF image processing. Specifically, we first design a class of domain-specific convolutions to disentangle LFs from different dimensions, and then leverage these disentangled features by designing task-specific modules. Our disentangling mechanism can well incorporate the LF structure prior and effectively handle 4D LF data. Based on the proposed mechanism, we develop three networks (i.e., DistgSSR, DistgASR and DistgDisp) for spatial super-resolution, angular super-resolution and disparity estimation. Experimental results show that our networks achieve state-of-the-art performance on all these three tasks, which demonstrates the effectiveness, efficiency, and generality of our disentangling mechanism. Project page: https://yingqianwang.github.io/DistgLF/.
引用
收藏
页码:425 / 443
页数:19
相关论文
共 82 条
  • [51] Depth from Combining Defocus and Correspondence Using Light-Field Cameras
    Tao, Michael W.
    Hadap, Sunil
    Malik, Jitendra
    Ramamoorthi, Ravi
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 673 - 680
  • [52] Tsai YJ, 2020, AAAI CONF ARTIF INTE, V34, P12095
  • [53] Light Field Reconstruction Using Shearlet Transform
    Vagharshakyan, Suren
    Bregovic, Robert
    Gotchev, Atanas
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (01) : 133 - 147
  • [54] Vaish V., 2008, Computer Graphics Laboratory, Stanford University, V6
  • [55] SVBRDF-Invariant Shape and Reflectance Estimation from a Light-Field Camera
    Wang, Ting-Chun
    Chandraker, Manmohan
    Efros, Alexei A.
    Ramamoorthi, Ravi
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (03) : 740 - 754
  • [56] Depth Estimation with Occlusion Modeling Using Light-Field Cameras
    Wang, Ting-Chun
    Efros, Alexei A.
    Ramamoorthi, Ravi
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (11) : 2170 - 2181
  • [57] Wang XP, 2018, IDEAS HIST MOD CHINA, V19, P1, DOI 10.1163/9789004385580_002
  • [58] Wang YA, 2020, IEEE WINT CONF APPL, P118, DOI 10.1109/WACV45572.2020.9093448
  • [59] Light Field Image Super-Resolution Using Deformable Convolution
    Wang, Yingqian
    Yang, Jungang
    Wang, Longguang
    Ying, Xinyi
    Wu, Tianhao
    An, Wei
    Guo, Yulan
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1057 - 1071
  • [60] Selective Light Field Refocusing for Camera Arrays Using Bokeh Rendering and Superresolution
    Wang, Yingqian
    Yang, Jungang
    Guo, Yulan
    Xiao, Chao
    An, Wei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (01) : 204 - 208