Non-Local Color Compensation Network for Intrinsic Image Decomposition

被引:8
作者
Zhang, Feng [1 ]
Jiang, Xiaoyue [1 ]
Xia, Zhaoqiang [1 ]
Gabbouj, Moncef [2 ]
Peng, Jinye [3 ]
Feng, Xiaoyi [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710129, Peoples R China
[2] Tampere Univ, Dept Comp Sci, Tampere 33101, Finland
[3] Northwest Univ, Sch Informat Sci & Technol, Xian 710069, Peoples R China
关键词
Image color analysis; Feature extraction; Image decomposition; Image reconstruction; Lighting; Task analysis; Decoding; Intrinsic image decomposition; color compensation; multi-scale attention; mutual constraint;
D O I
10.1109/TCSVT.2022.3199428
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Single image-based intrinsic image decomposition attempts to separate one input image into several intrinsic components, which is inherently an under-constrained problem. Some recent works have been proposed to estimate the intrinsic components using encoder-decoder structures. However, they generally lack exploration of the different component-oriented feature constraints and feature selection processes. In this paper, a non-local color compensation network (NCCNet) is proposed. Firstly, the hue and value channels of HSV color space are used as the complementary information for RGB images for the estimation of albedo and shading, respectively. The color space representation serves as an external constraint, which does not require expensive sensors or complicated computations. Secondly, an integrated non-local attention scheme is proposed to describe the relations of non-adjacent regions with a lower computational complexity compared to traditional methods. Then the non-local and local attention are combined to describe correlations among features and used as feature selectors between the encoder and decoder. Thirdly, the mutual constraint between albedo and shading is also explored in the network to further optimize the process. In order to train the network, a unified mutual exclusion loss function is proposed. Extensive experiments are conducted on several popular datasets, and the proposed NCCNet achieves improved performance with comparable computational cost compared to competing methods.
引用
收藏
页码:132 / 145
页数:14
相关论文
共 63 条
  • [1] Barron J. T., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P2521, DOI 10.1109/CVPR.2011.5995392
  • [2] Shape, Illumination, and Reflectance from Shading
    Barron, Jonathan T.
    Malik, Jitendra
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (08) : 1670 - 1687
  • [3] Intrinsic Scene Properties from a Single RGB-D Image
    Barron, Jonathan T.
    Malik, Jitendra
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 17 - 24
  • [4] Barron JT, 2012, LECT NOTES COMPUT SC, V7575, P57, DOI 10.1007/978-3-642-33765-9_5
  • [5] Barrow H., 1978, Comput. vis. syst, V2, P2
  • [6] Joint Learning of Intrinsic Images and Semantic Segmentation
    Baslamisli, Anil S.
    Groenestege, Thomas T.
    Das, Partha
    Le, Hoang-An
    Karaoglu, Sezer
    Gevers, Theo
    [J]. COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 289 - 305
  • [7] Bell M, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P670, DOI 10.1109/ICCV.2001.937585
  • [8] Intrinsic Images in the Wild
    Bell, Sean
    Bala, Kavita
    Snavely, Noah
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (04):
  • [9] Bi S, 2018, Arxiv, DOI arXiv:1807.11226
  • [10] Interactive Intrinsic Video Editing
    Bonneel, Nicolas
    Sunkavalli, Kalyan
    Tompkin, James
    Sun, Deqing
    Paris, Sylvain
    Pfister, Hanspeter
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (06):