L-CoIns: Language-based Colorization with Instance Awareness

被引:5
作者
Chang, Zheng [1 ]
Weng, Shuchen [2 ,3 ]
Zhang, Peixuan [1 ]
Li, Yu [4 ]
Li, Si [1 ]
Shi, Boxin [2 ,3 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
[2] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China
[3] Peking Univ, Sch Comp Sci, Natl Engn Res Ctr Visual Technol, Beijing, Peoples R China
[4] Int Digital Econ Acad, Shenzhen, Peoples R China
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
10.1109/CVPR52729.2023.01842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Language-based colorization produces plausible colors consistent with the language description provided by the user. Recent studies introduce additional annotation to prevent color-object coupling and mismatch issues, but they still have difficulty in distinguishing instances corresponding to the same object words. In this paper, we propose a transformer-based framework to automatically aggregate similar image patches and achieve instance awareness without any additional knowledge. By applying our presented luminance augmentation and counter-color loss to break down the statistical correlation between luminance and color words, our model is driven to synthesize colors with better descriptive consistency. We further collect a dataset to provide distinctive visual characteristics and detailed language descriptions for multiple instances in the same image. Extensive experiments demonstrate our advantages of synthesizing visually pleasing and descriptionconsistent results of instance-aware colorization.
引用
收藏
页码:19221 / 19230
页数:10
相关论文
共 56 条
  • [1] [Anonymous], 2017, NIPS
  • [2] Ardizzone Lynton, 2019, ARXIV190702392
  • [3] Bai Yunpeng, 2022, ECCV
  • [4] COCO-Stuff: Thing and Stuff Classes in Context
    Caesar, Holger
    Uijlings, Jasper
    Ferrari, Vittorio
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1209 - 1218
  • [5] Cao Y., 2017, ECML PKDD
  • [6] Carion N., 2020, P EUR C COMP VIS GLA, P213, DOI DOI 10.1007/978-3-030-58452-813
  • [7] Chang Zheng, 2022, ECCV
  • [8] Chen Hansheng, 2021, CVPR
  • [9] Image Blind Denoising With Generative Adversarial Network Based Noise Modeling
    Chen, Jingwen
    Chen, Jiawei
    Chao, Hongyang
    Yang, Ming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3155 - 3164
  • [10] Deep Colorization
    Cheng, Zezhou
    Yang, Qingxiong
    Sheng, Bin
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 415 - 423