Hypercomplex Image-to-Image Translation

被引:7
作者
Grassucci, Eleonora [1 ]
Sigillo, Luigi [1 ]
Uncini, Aurelio [1 ]
Comminiello, Danilo [1 ]
机构
[1] Sapienza Univ Rome, Dept Informat Engn Elect & Telecommun DIET, Rome, Italy
来源
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年
关键词
Hypercomplex Neural Networks; Generative Adversarial Networks; Image-to-Image Translation; Lightweight Models; CONVOLUTIONAL NEURAL-NETWORKS; QUATERNION;
D O I
10.1109/IJCNN55064.2022.9892119
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image-to-image translation (I2I) aims at transferring the content representation from an input domain to an output one, bouncing along different target domains. Recent I2I generative models, which gain outstanding results in this task, comprise a set of diverse deep networks each with tens of million parameters. Moreover, images are usually three-dimensional being composed of RGB channels and common neural models do not take dimensions correlation into account, losing beneficial information. In this paper, we propose to leverage hypercomplex algebra properties to define lightweight I2I generative models capable of preserving pre-existing relations among image dimensions, thus exploiting additional input information. On manifold I2I benchmarks, we show how the proposed Quaternion StarGANv2 and parameterized hypercomplex StarGANv2 (PHStarGANv2) reduce parameters and storage memory amount while ensuring high domain translation performance and good image quality as measured by FID and LPIPS scores. Full code is available at https://github.com/ispamm/HI2I.
引用
收藏
页数:8
相关论文
共 45 条
[1]  
Alaluf Yuval, 2022, Third time's the charm? image and video editing with stylegan3
[2]  
Almahairi A, 2018, PR MACH LEARN RES, V80
[3]  
[Anonymous], 2020, IEEE INT WORKS MACH
[4]  
Bachlechner T. C., 2021, C UNC ART INT UAI
[5]   Efficient Sound Event Localization and Detection in the Quaternion Domain [J].
Brignone, Christian ;
Mancini, Gioia ;
Grassucci, Eleonora ;
Uncini, Aurelio ;
Comminiello, Danilo .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (05) :2453-2457
[6]  
Brock A., 2019, ICLR
[7]  
Chen HT, 2020, AAAI CONF ARTIF INTE, V34, P3585
[8]  
Choi Y., 2018, IEEE CVF C COMP VIS
[9]  
Chong Min Jin, 2021, ARXIV210606561
[10]  
Comminiello D, 2019, INT CONF ACOUST SPEE, P8533, DOI [10.1109/ICASSP.2019.8682711, 10.1109/icassp.2019.8682711]