L-CoIns: Language-based Colorization with Instance Awareness

被引：5

作者：

Chang, Zheng ^{[1
]}

Weng, Shuchen ^{[2
,3
]}

Zhang, Peixuan ^{[1
]}

Li, Yu ^{[4
]}

Li, Si ^{[1
]}

Shi, Boxin ^{[2
,3
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China

[2] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China

[3] Peking Univ, Sch Comp Sci, Natl Engn Res Ctr Visual Technol, Beijing, Peoples R China

[4] Int Digital Econ Acad, Shenzhen, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

D O I：

10.1109/CVPR52729.2023.01842

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Language-based colorization produces plausible colors consistent with the language description provided by the user. Recent studies introduce additional annotation to prevent color-object coupling and mismatch issues, but they still have difficulty in distinguishing instances corresponding to the same object words. In this paper, we propose a transformer-based framework to automatically aggregate similar image patches and achieve instance awareness without any additional knowledge. By applying our presented luminance augmentation and counter-color loss to break down the statistical correlation between luminance and color words, our model is driven to synthesize colors with better descriptive consistency. We further collect a dataset to provide distinctive visual characteristics and detailed language descriptions for multiple instances in the same image. Extensive experiments demonstrate our advantages of synthesizing visually pleasing and descriptionconsistent results of instance-aware colorization.

引用

页码：19221 / 19230

页数：10

共 56 条

[1] [Anonymous], 2017, NIPS
[2] Ardizzone Lynton, 2019, ARXIV190702392
[3] Bai Yunpeng, 2022, ECCV
[4] COCO-Stuff: Thing and Stuff Classes in Context
Caesar, Holger
Uijlings, Jasper
Ferrari, Vittorio
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1209 - 1218
[5] Cao Y., 2017, ECML PKDD
[6] Carion N., 2020, P EUR C COMP VIS GLA, P213, DOI DOI 10.1007/978-3-030-58452-813
[7] Chang Zheng, 2022, ECCV
[8] Chen Hansheng, 2021, CVPR
[9] Image Blind Denoising With Generative Adversarial Network Based Noise Modeling
Chen, Jingwen
Chen, Jiawei
Chao, Hongyang
Yang, Ming
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3155 - 3164
[10] Deep Colorization
Cheng, Zezhou
Yang, Qingxiong
Sheng, Bin
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 415 - 423

← 1 2 3 4 5 6 →