Exploring the Temporal Consistency of Arbitrary Style Transfer: A Channelwise Perspective

被引:17
|
作者
Kong, Xiaoyu [1 ,2 ]
Deng, Yingying [3 ]
Tang, Fan [4 ]
Dong, Weiming [3 ]
Ma, Chongyang [5 ]
Chen, Yongyong
He, Zhenyu [6 ,7 ]
Xu, Changsheng [3 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Changchun 130012, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518073, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[5] Kuaishou Technol, Beijing 100085, Peoples R China
[6] Harbin Inst Technol, Dept Comp Sci, Shenzhen 518073, Peoples R China
[7] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
Correlation; Task analysis; Optical imaging; Integrated optics; Lighting; Optical fiber networks; Image reconstruction; Arbitrary stylization; channel correlation; cross-domain; feature migration;
D O I
10.1109/TNNLS.2022.3230084
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Arbitrary image stylization by neural networks has become a popular topic, and video stylization is attracting more attention as an extension of image stylization. However, when image stylization methods are applied to videos, unsatisfactory results that suffer from severe flickering effects appear. In this article, we conducted a detailed and comprehensive analysis of the cause of such flickering effects. Systematic comparisons among typical neural style transfer approaches show that the feature migration modules for state-of-the-art (SOTA) learning systems are ill-conditioned and could lead to a channelwise misalignment between the input content representations and the generated frames. Unlike traditional methods that relieve the misalignment via additional optical flow constraints or regularization modules, we focus on keeping the temporal consistency by aligning each output frame with the input frame. To this end, we propose a simple yet efficient multichannel correlation network (MCCNet), to ensure that output frames are directly aligned with inputs in the hidden feature space while maintaining the desired style patterns. An inner channel similarity loss is adopted to eliminate side effects caused by the absence of nonlinear operations such as softmax for strict alignment. Furthermore, to improve the performance of MCCNet under complex light conditions, we introduce an illumination loss during training. Qualitative and quantitative evaluations demonstrate that MCCNet performs well in arbitrary video and image style transfer tasks.
引用
收藏
页码:8482 / 8496
页数:15
相关论文
共 50 条
  • [1] Arbitrary style transfer via content consistency and style consistency
    Xiaoming Yu
    Gan Zhou
    The Visual Computer, 2024, 40 : 1369 - 1382
  • [2] Arbitrary style transfer via content consistency and style consistency
    Yu, Xiaoming
    Zhou, Gan
    VISUAL COMPUTER, 2024, 40 (03): : 1369 - 1382
  • [3] Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer
    Wu, Xinxiao
    Chen, Jialu
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1791 - 1799
  • [4] Preserving Structural Consistency in Arbitrary Artist and Artwork Style Transfer
    Wu, Jingyu
    Hou, Lefan
    Li, Zejian
    Liao, Jun
    Liu, Li
    Sun, Lingyun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2830 - 2838
  • [5] Consistent Arbitrary Style Transfer Using Consistency Training and Self-Attention Module
    Zhou, Zheng
    Wu, Yue
    Zhou, Yicong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16845 - 16856
  • [6] Consistent Arbitrary Style Transfer Using Consistency Training and Self-Attention Module
    Zhou, Zheng
    Wu, Yue
    Zhou, Yicong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16845 - 16856
  • [7] Style Permutation for Diversified Arbitrary Style Transfer
    Li, Pan
    Zhang, Dan
    Zhao, Lei
    Xu, Duanqing
    Lu, Dongming
    IEEE ACCESS, 2020, 8 (08): : 199147 - 199158
  • [8] Arbitrary Style Transfer with Style Enhancement and Structure Retention
    Yang, Sijia
    Zhou, Yun
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT II, 2024, 14496 : 401 - 413
  • [9] Arbitrary Style Transfer with Style-Attentional Networks
    Park, Dae Young
    Lee, Kwang Hee
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5873 - 5881
  • [10] PHOTO STYLE TRANSFER WITH CONSISTENCY LOSSES
    Yao, Xu
    Puy, Gilles
    Perez, Patrick
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2314 - 2318