L-CoIns: Language-based Colorization with Instance Awareness

被引:5
作者
Chang, Zheng [1 ]
Weng, Shuchen [2 ,3 ]
Zhang, Peixuan [1 ]
Li, Yu [4 ]
Li, Si [1 ]
Shi, Boxin [2 ,3 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
[2] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China
[3] Peking Univ, Sch Comp Sci, Natl Engn Res Ctr Visual Technol, Beijing, Peoples R China
[4] Int Digital Econ Acad, Shenzhen, Peoples R China
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
10.1109/CVPR52729.2023.01842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Language-based colorization produces plausible colors consistent with the language description provided by the user. Recent studies introduce additional annotation to prevent color-object coupling and mismatch issues, but they still have difficulty in distinguishing instances corresponding to the same object words. In this paper, we propose a transformer-based framework to automatically aggregate similar image patches and achieve instance awareness without any additional knowledge. By applying our presented luminance augmentation and counter-color loss to break down the statistical correlation between luminance and color words, our model is driven to synthesize colors with better descriptive consistency. We further collect a dataset to provide distinctive visual characteristics and detailed language descriptions for multiple instances in the same image. Extensive experiments demonstrate our advantages of synthesizing visually pleasing and descriptionconsistent results of instance-aware colorization.
引用
收藏
页码:19221 / 19230
页数:10
相关论文
共 56 条
  • [41] Weng S., 2022, AAAI
  • [42] Weng Shuchen, 2022, ECCV
  • [43] Xia M., 2022, TOG
  • [44] Xie Yanping, 2018, THESIS
  • [45] An Experimental Study of Characteristics of Solitary-Wave-Induced Scour Around a Pile Breakwater with a Discussion on Effects of the Distance Between Piles
    Xu, Conghao
    Huang, Zhenhua
    [J]. JOURNAL OF EARTHQUAKE AND TSUNAMI, 2022, 16 (03)
  • [46] Xu J., 2022, CVPR
  • [47] Xu ZY, 2020, PROC CVPR IEEE, P9360, DOI 10.1109/CVPR42600.2020.00938
  • [48] Yang Zizheng, 2022, CVPR
  • [49] The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
    Zhang, Richard
    Isola, Phillip
    Efros, Alexei A.
    Shechtman, Eli
    Wang, Oliver
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 586 - 595
  • [50] Colorful Image Colorization
    Zhang, Richard
    Isola, Phillip
    Efros, Alexei A.
    [J]. COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 649 - 666