COLLABORATIVE SPATIAL-TEMPORAL DISTILLATION FOR EFFICIENT VIDEO DERAINING

被引：0

作者：

Hu, Yuzhang ^{[1
]}

Liu, Minghao ^{[1
]}

Yang, Wenhan ^{[2
]}

Liu, Jiaying ^{[1
]}

Guo, Zongming ^{[1
]}

机构：

[1] Peking Univ, Beijing, Peoples R China

[2] Peng Cheng Lab, Shenzhen, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME | 2023年

基金：

中国国家自然科学基金;

关键词：

Video Deraining; Knowledge Distillation; Spatial Alignment; Temporal Alignment; Spatial-Temporal Adaptor; RAIN;

D O I：

10.1109/ICME55011.2023.00332

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a novel knowledge distillation framework to improve the efficiency of deep networks for video deraining. The knowledge is transferred from a large-scale powerful teacher network to a compact efficient student network via the proposed collaborative spatial-temporal distillation framework. The framework is equipped with three collaboration schemes of different granularities that make use of spatial-temporal redundancy in a complementary way for better distillation performance. First, the spatial alignment module applies distillation constraints at different spatial scales to achieve better scale invariance in transferred knowledge. Second, the temporal alignment module traces both temporal status between teacher and student separately and collaboratively, to comprehensively utilize inter-frame information. Third, these two alignment modules interact through a spatial-temporal adaptor, where spatial-temporal knowledge is transferred in a unified framework. Extensive experiments demonstrate the superiority of our distillation framework as well as the effectiveness of each module. Our code is available at: https://github.com/HuYuzhang/Knowledge-Distillation.

引用

页码：1937 / 1942

页数：6

共 50 条

[1] Latency-Constrained Spatial-Temporal Aggregated Architecture Search for Video Deraining
Liu, Zhu
Ma, Long
Liu, Risheng
Fan, Xin
Luo, Zhongxuan
Zhang, Yuduo
PATTERN RECOGNITION AND COMPUTER VISION,, PT III, 2021, 13021 : 16 - 28
[2] Progressive Spatial-temporal Collaborative Network for Video Frame Interpolation
Hu, Mengshun
Jiang, Kui
Liao, Liang
Nie, Zhixiang
Xiao, Jing
Wang, Zheng
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2145 - 2153
[3] Progressive Spatial-temporal Collaborative Network for Video Frame Interpolation
Hu, Mengshun
Jiang, Kui
Liao, Liang
Nie, Zhixiang
Xiao, Jing
Wang, Zheng
MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia, 2022, : 2145 - 2153
[4] Efficient Video Transformers with Spatial-Temporal Token Selection
Wang, Junke
Yang, Xitong
Li, Hengduo
Liu, Li
Wu, Zuxuan
Jiang, Yu-Gang
COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 69 - 86
[5] CTVSR: Collaborative Spatial-Temporal Transformer for Video Super-Resolution
Tang, Jun
Lu, Chenyan
Liu, Zhengxue
Li, Jiale
Dai, Hang
Ding, Yong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5018 - 5032
[6] LightViD: Efficient Video Deblurring With Spatial-Temporal Feature Fusion
Lin, Liqun
Wei, Guangpeng
Liu, Kanglin
Feng, Wanjian
Zhao, Tiesong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7430 - 7439
[7] An Efficient Spatial-Temporal Polyp Detection Framework for Colonoscopy Video
Zhang, Pengfei
Sun, Xinzi
Wang, Dechun
Wang, Xizhe
Cao, Yu
Liu, Benyuan
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1252 - 1259
[8] Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
Hui, Tianrui
Huang, Shaofei
Liu, Si
Ding, Zihan
Li, Guanbin
Wang, Wenguan
Han, Jizhong
Wang, Fei
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4185 - 4194
[9] Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
Hui, Tianrui
Huang, Shaofei
Liu, Si
Ding, Zihan
Li, Guanbin
Wang, Wenguan
Han, Jizhong
Wang, Fei
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2021, : 4185 - 4194
[10] Collaborative spatial-temporal video salient object detection with cross attention transformer
Su, Yuting
Wang, Weikang
Liu, Jing
Jing, Peiguang
SIGNAL PROCESSING, 2024, 224

← 1 2 3 4 5 →