Video Inverse Tone Mapping Network with Luma and Chroma Mapping

被引:2
作者
Huang, Peihuan [1 ]
Cao, Gaofeng [2 ,3 ]
Zhou, Fei [1 ,3 ,4 ]
Qiu, Guoping [1 ,5 ,6 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen, Peoples R China
[2] Peking Univ, Sch Elect & Comp Engn, Beijing, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] Guangdong Key Lab Intelligent Informat Proc, Shenzhen, Peoples R China
[5] Univ Nottingham, Sch Comp Sci, Nottingham, England
[6] Guangdong Hong Kong Joint Lab Big Data Imaging &, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
基金
中国国家自然科学基金;
关键词
Inverse Tone Mapping; Perceptual Uniformity; ICTCP color space;
D O I
10.1145/3581783.3612199
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the popularity of consumer high dynamic range (HDR) display devices, video inverse tone mapping (iTM) has become a research hotspot. However, existing methods are designed based on a perceptual non-uniformity color space (e.g., RGB and TCBCR), resulting in limited quality of HDR video rendered by these methods. Considering the two key factors involved in the video iTM task: luma and chroma, in this paper, we design an ICTCP color space based video iTM model, which reproduces high quality HDR video by processing luma and chroma information. Benefitting from the decorrelated perception of luma and chroma in the ICTCP color space, two global mapping networks (INet and TPNet) are developed to enhance the luma and chroma pixels, respectively. However, luma and chroma mapping in the iTM task may be affected by color appearance phenomena. Thus, a luma-chroma adaptation transform network (LCATNet) is proposed to process the luma and chroma pixels affected by color appearance phenomena, which can complement the local details to the globally enhanced luma and chroma pixels. In the LCATNet, either the luma mapping or the chroma mapping is adaptively adjusted according to both the luma and the chroma information. Besides, benefitting from the perceptually consistent property of the ICTCP color space, the same pixel errors can draw equal model attentions during the training. Thus, the proposed model can correctly render luma and chroma information without highlighting special regions or designing special training losses. Extensive experimental results demonstrate the effectiveness of the proposed model.
引用
收藏
页码:1383 / 1391
页数:9
相关论文
共 39 条
  • [1] [Anonymous], 2019, OBJ METR ASS POT VIS
  • [2] Cao GF, 2022, PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, P806
  • [3] A Decoupled Kernel Prediction Network Guided by Soft Mask for Single Image HDR Reconstruction
    Cao, Gaofeng
    Zhou, Fei
    Liu, Kanglin
    Wang, Anjie
    Fan, Leidong
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [4] A brightness-adaptive kernel prediction network for inverse tone mapping
    Cao, Gaofeng
    Zhou, Fei
    Liu, Kanglin
    Liu, Bozhi
    [J]. NEUROCOMPUTING, 2021, 464 : 1 - 14
  • [5] Chen G., 2021, ICCV, P2502
  • [6] A New Journey from SDRTV to HDRTV
    Chen, Xiangyu
    Zhang, Zhengwen
    Ren, Jimmy S.
    Tian, Lynhoo
    Qiao, Yu
    Dong, Chao
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4480 - 4489
  • [7] HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization
    Chen, Xiangyu
    Liu, Yihao
    Zhang, Zhengwen
    Qiao, Yu
    Dong, Chao
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 354 - 363
  • [8] Study on Panel Sharpening in Different Color Spaces
    Dai, Min
    Huang, Ai-Mei
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXVI, 2013, 8856
  • [9] Dolby, 2016, ICTCP DOLB WHIT PAP
  • [10] Learning a Deep Convolutional Network for Image Super-Resolution
    Dong, Chao
    Loy, Chen Change
    He, Kaiming
    Tang, Xiaoou
    [J]. COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 184 - 199