COLLABORATIVE SPATIAL-TEMPORAL DISTILLATION FOR EFFICIENT VIDEO DERAINING

被引:0
|
作者
Hu, Yuzhang [1 ]
Liu, Minghao [1 ]
Yang, Wenhan [2 ]
Liu, Jiaying [1 ]
Guo, Zongming [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Video Deraining; Knowledge Distillation; Spatial Alignment; Temporal Alignment; Spatial-Temporal Adaptor; RAIN;
D O I
10.1109/ICME55011.2023.00332
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel knowledge distillation framework to improve the efficiency of deep networks for video deraining. The knowledge is transferred from a large-scale powerful teacher network to a compact efficient student network via the proposed collaborative spatial-temporal distillation framework. The framework is equipped with three collaboration schemes of different granularities that make use of spatial-temporal redundancy in a complementary way for better distillation performance. First, the spatial alignment module applies distillation constraints at different spatial scales to achieve better scale invariance in transferred knowledge. Second, the temporal alignment module traces both temporal status between teacher and student separately and collaboratively, to comprehensively utilize inter-frame information. Third, these two alignment modules interact through a spatial-temporal adaptor, where spatial-temporal knowledge is transferred in a unified framework. Extensive experiments demonstrate the superiority of our distillation framework as well as the effectiveness of each module. Our code is available at: https://github.com/HuYuzhang/Knowledge-Distillation.
引用
收藏
页码:1937 / 1942
页数:6
相关论文
共 50 条
  • [1] Latency-Constrained Spatial-Temporal Aggregated Architecture Search for Video Deraining
    Liu, Zhu
    Ma, Long
    Liu, Risheng
    Fan, Xin
    Luo, Zhongxuan
    Zhang, Yuduo
    PATTERN RECOGNITION AND COMPUTER VISION,, PT III, 2021, 13021 : 16 - 28
  • [2] Progressive Spatial-temporal Collaborative Network for Video Frame Interpolation
    Hu, Mengshun
    Jiang, Kui
    Liao, Liang
    Nie, Zhixiang
    Xiao, Jing
    Wang, Zheng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2145 - 2153
  • [3] Progressive Spatial-temporal Collaborative Network for Video Frame Interpolation
    Hu, Mengshun
    Jiang, Kui
    Liao, Liang
    Nie, Zhixiang
    Xiao, Jing
    Wang, Zheng
    MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia, 2022, : 2145 - 2153
  • [4] Efficient Video Transformers with Spatial-Temporal Token Selection
    Wang, Junke
    Yang, Xitong
    Li, Hengduo
    Liu, Li
    Wu, Zuxuan
    Jiang, Yu-Gang
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 69 - 86
  • [5] CTVSR: Collaborative Spatial-Temporal Transformer for Video Super-Resolution
    Tang, Jun
    Lu, Chenyan
    Liu, Zhengxue
    Li, Jiale
    Dai, Hang
    Ding, Yong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5018 - 5032
  • [6] LightViD: Efficient Video Deblurring With Spatial-Temporal Feature Fusion
    Lin, Liqun
    Wei, Guangpeng
    Liu, Kanglin
    Feng, Wanjian
    Zhao, Tiesong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7430 - 7439
  • [7] An Efficient Spatial-Temporal Polyp Detection Framework for Colonoscopy Video
    Zhang, Pengfei
    Sun, Xinzi
    Wang, Dechun
    Wang, Xizhe
    Cao, Yu
    Liu, Benyuan
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1252 - 1259
  • [8] Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
    Hui, Tianrui
    Huang, Shaofei
    Liu, Si
    Ding, Zihan
    Li, Guanbin
    Wang, Wenguan
    Han, Jizhong
    Wang, Fei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4185 - 4194
  • [9] Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
    Hui, Tianrui
    Huang, Shaofei
    Liu, Si
    Ding, Zihan
    Li, Guanbin
    Wang, Wenguan
    Han, Jizhong
    Wang, Fei
    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2021, : 4185 - 4194
  • [10] Collaborative spatial-temporal video salient object detection with cross attention transformer
    Su, Yuting
    Wang, Weikang
    Liu, Jing
    Jing, Peiguang
    SIGNAL PROCESSING, 2024, 224