Focal Frequency Loss for Image Reconstruction and Synthesis

被引:218
作者
Jiang, Liming [1 ]
Dai, Bo [1 ]
Wu, Wayne [2 ]
Loy, Chen Change [1 ]
机构
[1] Nanyang Technol Univ, S Lab, Singapore, Singapore
[2] SenseTime Res, Hong Kong, Peoples R China
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
关键词
D O I
10.1109/ICCV48922.2021.01366
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image reconstruction and synthesis have witnessed remarkable progress thanks to the development of generative models. Nonetheless, gaps could still exist between the real and generated images, especially in the frequency domain. In this study, we show that narrowing gaps in the frequency domain can ameliorate image reconstruction and synthesis quality further. We propose a novel focal frequency loss, which allows a model to adaptively focus on frequency components that are hard to synthesize by down-weighting the easy ones. This objective function is complementary to existing spatial losses, offering great impedance against the loss of important frequency information due to the inherent bias of neural networks. We demonstrate the versatility and effectiveness of focal frequency loss to improve popular models, such as VAE, pix2pix, and SPADE, in both perceptual quality and quantitative performance. We further show its potential on StyleGAN2.1,
引用
收藏
页码:13899 / 13909
页数:11
相关论文
共 72 条
[1]  
[Anonymous], 2014, CVPR, DOI DOI 10.1109/CVPR.2014.32
[2]  
[Anonymous], 2020, ICML
[3]  
[Anonymous], 2018, ECCV, DOI DOI 10.1007/978-3-030-01249-611
[4]  
[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00492
[5]  
[Anonymous], 2019, ICML
[6]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00244
[7]  
Brock Andrew, 2018, INT C LEARN REPR
[8]  
Cai Mu, 2020, ARXIV201113611
[9]   Photographic Image Synthesis with Cascaded Refinement Networks [J].
Chen, Qifeng ;
Koltun, Vladlen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1520-1529
[10]  
Chen W, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE RCAR), P22, DOI 10.1109/RCAR.2016.7783995