Dual-Affinity Style Embedding Network for Semantic-Aligned Image Style Transfer

被引:14
|
作者
Ma, Zhuoqi [1 ]
Lin, Tianwei [2 ]
Li, Xin [2 ]
Li, Fu [2 ]
He, Dongliang [2 ]
Ding, Errui [2 ]
Wang, Nannan [3 ]
Gao, Xinbo [4 ,5 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian Key Lab Big Data & Intelligent Vis, Xian 710071, Peoples R China
[2] Baidu Inc, Dept Comp Vis Vis Technol, Beijing 100080, Peoples R China
[3] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[4] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[5] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Optimization; Feature extraction; Visualization; Correlation; Training; Real-time systems; Dual-affinity; semantic style transfer; style embedding;
D O I
10.1109/TNNLS.2022.3143356
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image style transfer aims at synthesizing an image with the content from one image and the style from another. User studies have revealed that the semantic correspondence between style and content greatly affects subjective perception of style transfer results. While current studies have made great progress in improving the visual quality of stylized images, most methods directly transfer global style statistics without considering semantic alignment. Current semantic style transfer approaches still work in an iterative optimization fashion, which is impractically computationally expensive. Addressing these issues, we introduce a novel dual-affinity style embedding network (DaseNet) to synthesize images with style aligned at semantic region granularity. In the dual-affinity module, feature correlation and semantic correspondence between content and style images are modeled jointly for embedding local style patterns according to semantic distribution. Furthermore, the semantic-weighted style loss and the region-consistency loss are introduced to ensure semantic alignment and content preservation. With the end-to-end network architecture, DaseNet can well balance visual quality and inference efficiency for semantic style transfer. Experimental results on different scene categories have demonstrated the effectiveness of the proposed method.
引用
收藏
页码:7404 / 7417
页数:14
相关论文
共 45 条
  • [1] Semantic-Aligned Attention With Refining Feature Embedding for Few-Shot Image Classification
    Xu, Xianda
    Xu, Xing
    Shen, Fumin
    Li, Yujie
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25458 - 25468
  • [2] Image Style Transfer Algorithm Based on Semantic Segmentation
    Lin, Zhijie
    Wang, Zhizhong
    Chen, Haibo
    Ma, Xiaolong
    Xie, Chuan
    Xing, Wei
    Zhao, Lei
    Song, Wei
    IEEE ACCESS, 2021, 9 : 54518 - 54529
  • [3] Semantic Context-Aware Image Style Transfer
    Liao, Yi-Sheng
    Huang, Chun-Rong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1911 - 1923
  • [4] Neural Style Palette: A Multimodal and Interactive Style Transfer From a Single Style Image
    Virtusio, John Jethro
    Ople, Jose Jaena Mari
    Tan, Daniel Stanley
    Tanveer, M.
    Kumar, Neeraj
    Hua, Kai-Lung
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2245 - 2258
  • [5] Unbiased Image Style Transfer
    Choi, Hyun-Chul
    IEEE ACCESS, 2020, 8 : 196600 - 196608
  • [6] TridentCap: Image-Fact-Style Trident Semantic Framework for Stylized Image Captioning
    Wang, Lanxiao
    Qiu, Heqian
    Qiu, Benliu
    Meng, Fanman
    Wu, Qingbo
    Li, Hongliang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3563 - 3575
  • [7] SSTRN: Semantic Style Transfer Reference Network for Face Super-Resolution
    Farhangfar, Saghar
    Baradarani, Aryaz
    Balafar, Mohammad Ali
    Asadpour, Mohammad
    2022 29TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2022,
  • [8] Content-Biased and Style-Assisted Transfer Network for Cross-Scene Hyperspectral Image Classification
    Shi, Zuowei
    Lai, Xudong
    Deng, Juan
    Liu, Jinshuo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [9] Structure-Guided Arbitrary Style Transfer for Artistic Image and Video
    Liu, Shiguang
    Zhu, Ting
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1299 - 1312
  • [10] Tear the Image Into Strips for Style Transfer
    Huang, Yujie
    Liu, Yuhao
    Jing, Minge
    Zeng, Xiaoyang
    Fan, Yibo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3978 - 3988