GRAY-SCALE IMAGE COLORIZATION USING CYCLE-CONSISTENT GENERATIVE ADVERSARIAL NETWORKS WITH RESIDUAL STRUCTURE ENHANCER

被引:0
作者
Johari, Mohammad Mahdi [1 ]
Behroozi, Hamid [1 ]
机构
[1] Sharif Univ Technol, Elect Engn Dept, Tehran, Iran
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
Cycle-Consistency; Generative Adversarial Networks; Image Colorization; Residual Structure;
D O I
10.1109/icassp40776.2020.9054432
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The colorization of gray-scale images has always been a challenging task in computer vision. Recently, novel approaches have been introduced for unsupervised image translation between two domains using Generative Adversarial Networks (GANs). Since one can consider the gray-scale and colorful images as two separate domains, we propose a two-stage cycle-consistent network architecture to produce convincible images. First, an intermediate image is generated with a relatively uncomplicated objective function at the output. Next, at the second stage, the intermediate image is enhanced via a residual network structure with a more complicated objective function. Furthermore, by employing two inverse networks, a cycle-consistent architecture is formed at both stages. The proposed model is trained on the ImageNet dataset, and the achieved outcomes demonstrate exceptional performance comparing with the state-of-the-art models.
引用
收藏
页码:2223 / 2227
页数:5
相关论文
共 28 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], P IEEE INT C COMP VI
[3]  
[Anonymous], 2015, Tech. Rep.
[4]   Recent Advances of Generative Adversarial Networks in Computer Vision [J].
Cao, Yang-Jie ;
Jia, Li-Li ;
Chen, Yong-Xia ;
Lin, Nan ;
Yang, Cong ;
Zhang, Bo ;
Liu, Zhi ;
Li, Xue-Xiang ;
Dai, Hong-Hua .
IEEE ACCESS, 2019, 7 :14985-15006
[5]   Unsupervised Diverse Colorization via Generative Adversarial Networks [J].
Cao, Yun ;
Zhou, Zhiming ;
Zhang, Weinan ;
Yu, Yong .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT I, 2017, 10534 :151-166
[6]  
Chen ZH, 2015, ACSR ADV COMPUT, P123
[7]   Sem-GAN: Semantically-Consistent Image-to-Image Translation [J].
Cherian, Anoop ;
Sullivan, Alan .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1797-1806
[8]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[9]  
He K, 2016, PROC CVPR IEEE, P770, DOI [10.1109/CVPR.2016.90, DOI 10.1109/CVPR.2016.90]
[10]  
Ioffe Sergey, 2015, Proceedings of Machine Learning Research, V37, P448, DOI DOI 10.48550/ARXIV.1502.03167