Non-Local Color Compensation Network for Intrinsic Image Decomposition

被引：8

作者：

Zhang, Feng ^{[1
]}

Jiang, Xiaoyue ^{[1
]}

Xia, Zhaoqiang ^{[1
]}

Gabbouj, Moncef ^{[2
]}

Peng, Jinye ^{[3
]}

Feng, Xiaoyi ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710129, Peoples R China

[2] Tampere Univ, Dept Comp Sci, Tampere 33101, Finland

[3] Northwest Univ, Sch Informat Sci & Technol, Xian 710069, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2023年 / 33卷 / 01期

关键词：

Image color analysis; Feature extraction; Image decomposition; Image reconstruction; Lighting; Task analysis; Decoding; Intrinsic image decomposition; color compensation; multi-scale attention; mutual constraint;

D O I：

10.1109/TCSVT.2022.3199428

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Single image-based intrinsic image decomposition attempts to separate one input image into several intrinsic components, which is inherently an under-constrained problem. Some recent works have been proposed to estimate the intrinsic components using encoder-decoder structures. However, they generally lack exploration of the different component-oriented feature constraints and feature selection processes. In this paper, a non-local color compensation network (NCCNet) is proposed. Firstly, the hue and value channels of HSV color space are used as the complementary information for RGB images for the estimation of albedo and shading, respectively. The color space representation serves as an external constraint, which does not require expensive sensors or complicated computations. Secondly, an integrated non-local attention scheme is proposed to describe the relations of non-adjacent regions with a lower computational complexity compared to traditional methods. Then the non-local and local attention are combined to describe correlations among features and used as feature selectors between the encoder and decoder. Thirdly, the mutual constraint between albedo and shading is also explored in the network to further optimize the process. In order to train the network, a unified mutual exclusion loss function is proposed. Extensive experiments are conducted on several popular datasets, and the proposed NCCNet achieves improved performance with comparable computational cost compared to competing methods.

引用

页码：132 / 145

页数：14

共 63 条

[1] Barron J. T., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P2521, DOI 10.1109/CVPR.2011.5995392
[2] Shape, Illumination, and Reflectance from Shading
Barron, Jonathan T.
Malik, Jitendra
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (08) : 1670 - 1687
[3] Intrinsic Scene Properties from a Single RGB-D Image
Barron, Jonathan T.
Malik, Jitendra
[J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 17 - 24
[4] Barron JT, 2012, LECT NOTES COMPUT SC, V7575, P57, DOI 10.1007/978-3-642-33765-9_5
[5] Barrow H., 1978, Comput. vis. syst, V2, P2
[6] Joint Learning of Intrinsic Images and Semantic Segmentation
Baslamisli, Anil S.
Groenestege, Thomas T.
Das, Partha
Le, Hoang-An
Karaoglu, Sezer
Gevers, Theo
[J]. COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 289 - 305
[7] Bell M, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P670, DOI 10.1109/ICCV.2001.937585
[8] Intrinsic Images in the Wild
Bell, Sean
Bala, Kavita
Snavely, Noah
[J]. ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (04):
[9] Bi S, 2018, Arxiv, DOI arXiv:1807.11226
[10] Interactive Intrinsic Video Editing
Bonneel, Nicolas
Sunkavalli, Kalyan
Tompkin, James
Sun, Deqing
Paris, Sylvain
Pfister, Hanspeter
[J]. ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (06):

← 1 2 3 4 5 6 7 →