Video Inverse Tone Mapping Network with Luma and Chroma Mapping

被引：2

作者：

Huang, Peihuan ^{[1
]}

Cao, Gaofeng ^{[2
,3
]}

Zhou, Fei ^{[1
,3
,4
]}

Qiu, Guoping ^{[1
,5
,6
]}

机构：

[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen, Peoples R China

[2] Peking Univ, Sch Elect & Comp Engn, Beijing, Peoples R China

[3] Peng Cheng Lab, Shenzhen, Peoples R China

[4] Guangdong Key Lab Intelligent Informat Proc, Shenzhen, Peoples R China

[5] Univ Nottingham, Sch Comp Sci, Nottingham, England

[6] Guangdong Hong Kong Joint Lab Big Data Imaging &, Hong Kong, Peoples R China

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

基金：

中国国家自然科学基金;

关键词：

Inverse Tone Mapping; Perceptual Uniformity; ICTCP color space;

D O I：

10.1145/3581783.3612199

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the popularity of consumer high dynamic range (HDR) display devices, video inverse tone mapping (iTM) has become a research hotspot. However, existing methods are designed based on a perceptual non-uniformity color space (e.g., RGB and TCBCR), resulting in limited quality of HDR video rendered by these methods. Considering the two key factors involved in the video iTM task: luma and chroma, in this paper, we design an ICTCP color space based video iTM model, which reproduces high quality HDR video by processing luma and chroma information. Benefitting from the decorrelated perception of luma and chroma in the ICTCP color space, two global mapping networks (INet and TPNet) are developed to enhance the luma and chroma pixels, respectively. However, luma and chroma mapping in the iTM task may be affected by color appearance phenomena. Thus, a luma-chroma adaptation transform network (LCATNet) is proposed to process the luma and chroma pixels affected by color appearance phenomena, which can complement the local details to the globally enhanced luma and chroma pixels. In the LCATNet, either the luma mapping or the chroma mapping is adaptively adjusted according to both the luma and the chroma information. Besides, benefitting from the perceptually consistent property of the ICTCP color space, the same pixel errors can draw equal model attentions during the training. Thus, the proposed model can correctly render luma and chroma information without highlighting special regions or designing special training losses. Extensive experimental results demonstrate the effectiveness of the proposed model.

引用

页码：1383 / 1391

页数：9

共 39 条

[1] [Anonymous], 2019, OBJ METR ASS POT VIS
[2] Cao GF, 2022, PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, P806
[3] A Decoupled Kernel Prediction Network Guided by Soft Mask for Single Image HDR Reconstruction
Cao, Gaofeng
Zhou, Fei
Liu, Kanglin
Wang, Anjie
Fan, Leidong
[J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
[4] A brightness-adaptive kernel prediction network for inverse tone mapping
Cao, Gaofeng
Zhou, Fei
Liu, Kanglin
Liu, Bozhi
[J]. NEUROCOMPUTING, 2021, 464 : 1 - 14
[5] Chen G., 2021, ICCV, P2502
[6] A New Journey from SDRTV to HDRTV
Chen, Xiangyu
Zhang, Zhengwen
Ren, Jimmy S.
Tian, Lynhoo
Qiao, Yu
Dong, Chao
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4480 - 4489
[7] HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization
Chen, Xiangyu
Liu, Yihao
Zhang, Zhengwen
Qiao, Yu
Dong, Chao
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 354 - 363
[8] Study on Panel Sharpening in Different Color Spaces
Dai, Min
Huang, Ai-Mei
[J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXVI, 2013, 8856
[9] Dolby, 2016, ICTCP DOLB WHIT PAP
[10] Learning a Deep Convolutional Network for Image Super-Resolution
Dong, Chao
Loy, Chen Change
He, Kaiming
Tang, Xiaoou
[J]. COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 184 - 199

← 1 2 3 4 →