Dual-Affinity Style Embedding Network for Semantic-Aligned Image Style Transfer

被引:14
|
作者
Ma, Zhuoqi [1 ]
Lin, Tianwei [2 ]
Li, Xin [2 ]
Li, Fu [2 ]
He, Dongliang [2 ]
Ding, Errui [2 ]
Wang, Nannan [3 ]
Gao, Xinbo [4 ,5 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian Key Lab Big Data & Intelligent Vis, Xian 710071, Peoples R China
[2] Baidu Inc, Dept Comp Vis Vis Technol, Beijing 100080, Peoples R China
[3] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[4] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[5] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Optimization; Feature extraction; Visualization; Correlation; Training; Real-time systems; Dual-affinity; semantic style transfer; style embedding;
D O I
10.1109/TNNLS.2022.3143356
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image style transfer aims at synthesizing an image with the content from one image and the style from another. User studies have revealed that the semantic correspondence between style and content greatly affects subjective perception of style transfer results. While current studies have made great progress in improving the visual quality of stylized images, most methods directly transfer global style statistics without considering semantic alignment. Current semantic style transfer approaches still work in an iterative optimization fashion, which is impractically computationally expensive. Addressing these issues, we introduce a novel dual-affinity style embedding network (DaseNet) to synthesize images with style aligned at semantic region granularity. In the dual-affinity module, feature correlation and semantic correspondence between content and style images are modeled jointly for embedding local style patterns according to semantic distribution. Furthermore, the semantic-weighted style loss and the region-consistency loss are introduced to ensure semantic alignment and content preservation. With the end-to-end network architecture, DaseNet can well balance visual quality and inference efficiency for semantic style transfer. Experimental results on different scene categories have demonstrated the effectiveness of the proposed method.
引用
收藏
页码:7404 / 7417
页数:14
相关论文
共 45 条
  • [21] Artifact-Free Image Style Transfer by Using Feature Map Clamping
    Lee, Hyeonjin
    Choi, Hyun-Chul
    IEEE ACCESS, 2020, 8 : 89489 - 89496
  • [22] Dual-Level Semantic Transfer Deep Hashing for Efficient Social Image Retrieval
    Zhu, Lei
    Cui, Hui
    Cheng, Zhiyong
    Li, Jingjing
    Zhang, Zheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (04) : 1478 - 1489
  • [23] Global-Local Coupled Style Transfer for Semantic Segmentation of Bitemporal Remote Sensing Images
    Wang, Hao
    Guo, Mingning
    Li, Shaoxian
    Li, Haifeng
    Tao, Chao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [24] Knowledge-Guided Semantic Transfer Network for Few-Shot Image Recognition
    Li, Zechao
    Tang, Hao
    Peng, Zhimao
    Qi, Guo-Jun
    Tang, Jinhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [25] Image Style Transfer-Based Data Augmentation for Sanitary Ceramic Defect Detection
    Hang, Jingfan
    Yang, Xianqiang
    Ye, Chao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [26] Side-Scan Sonar Image Classification With Zero-Shot and Style Transfer
    Bai, Zhongyu
    Xu, Hongli
    Ding, Qichuan
    Zhang, Xiangyue
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 15
  • [27] Mining Semantic Information With Dual Relation Graph Network for Multi-Label Image Classification
    Zhou, Wei
    Jiang, Weitao
    Chen, Dihu
    Hu, Haifeng
    Su, Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1143 - 1157
  • [28] An Edge Filter Based Approach of Neural Style Transfer to the Image Stylization
    Bagwari, Shubham
    Choudhary, Kanika
    Raikwar, Suresh
    Nijhawan, Rahul
    Kumar, Sunil
    Shah, Mohd Asif
    IEEE ACCESS, 2022, 10 : 104612 - 104621
  • [29] Topological Structure and Semantic Information Transfer Network for Cross-Scene Hyperspectral Image Classification
    Zhang, Yuxiang
    Li, Wei
    Zhang, Mengmeng
    Qu, Ying
    Tao, Ran
    Qi, Hairong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (06) : 2817 - 2830
  • [30] CycleGAN for Flash-to-Ambient Image Conversion: A Style Transfer Approach
    Ramanathan, Siddharth
    Mukherjee, Archit
    Moulisvaran, U.
    Pramoth, K. P.
    Kothandaraman, Mohanaprasad
    Monica, P. Reena
    Kumar, L. Ramesh
    Gangele, Sunil
    IEEE ACCESS, 2024, 12 : 181987 - 181996