Restoration of User Videos Shared on Social Media

被引:2
作者
Luo, Hongming [1 ]
Zhou, Fei [1 ,3 ,4 ]
Lam, Kin-Man [2 ]
Qiu, Guoping [1 ,5 ,6 ,7 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen, Peoples R China
[2] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] Guangdong Key Lab Intelligent Informat Proc, Shenzhen, Peoples R China
[5] Shenzhen Key Lab Digital Creat Technol, Shenzhen, Peoples R China
[6] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Peoples R China
[7] Guangdong Hong Kong Joint Lab Big Data Imaging &, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年
关键词
social media; video restoration; degradation sensing;
D O I
10.1145/3503161.3547847
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
User videos shared on social media platforms usually suffer from degradations caused by unknown proprietary processing procedures, which means that their visual quality is poorer than that of the originals. This paper presents a new general video restoration framework for the restoration of user videos shared on social media platforms. In contrast to most deep learning-based video restoration methods that perform end-to-end mapping, where feature extraction is mostly treated as a black box, in the sense that what role a feature plays is often unknown, our new method, termed Video restOration through adapTive dEgradation Sensing (VOTES), introduces the concept of a degradation feature map (DFM) to explicitly guide the video restoration process. Specifically, for each video frame, we first adaptively estimate its DFM to extract features representing the difficulty of restoring its different regions. We then feed the DFM to a convolutional neural network (CNN) to compute hierarchical degradation features to modulate an endto-end video restoration backbone network, such that more attention is paid explicitly to potentially more difficult to restore areas, which in turn leads to enhanced restoration performance. We will explain the design rationale of the VOTES framework and present extensive experimental results to show that the new VOTES method outperforms various state-of-the-art techniques both quantitatively and qualitatively. In addition, we contribute a large scale real-world database of user videos shared on different social media platforms. Codes and datasets are available at https://github.com/luohongming/VOTES.git
引用
收藏
页码:2749 / 2757
页数:9
相关论文
共 32 条
[1]   BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond [J].
Chan, Kelvin C. K. ;
Wang, Xintao ;
Yu, Ke ;
Dong, Chao ;
Loy, Chen Change .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4945-4954
[2]   CISRDCNN: Super-resolution of compressed images using deep convolutional neural networks [J].
Chen, Honggang ;
He, Xiaohai ;
Ren, Chao ;
Qing, Linbo ;
Teng, Qizhi .
NEUROCOMPUTING, 2018, 285 :204-219
[3]   Second-order Attention Network for Single Image Super-Resolution [J].
Dai, Tao ;
Cai, Jianrui ;
Zhang, Yongbing ;
Xia, Shu-Tao ;
Zhang, Lei .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11057-11066
[4]  
Deng JN, 2020, AAAI CONF ARTIF INTE, V34, P10696
[5]   JPEG Artifacts Reduction via Deep Convolutional Sparse Coding [J].
Fu, Xueyang ;
Zha, Zheng-Jun ;
Wu, Feng ;
Ding, Xinghao ;
Paisley, John .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2501-2510
[6]   Blind Super-Resolution With Iterative Kernel Correction [J].
Gu, Jinjin ;
Lu, Hannan ;
Zuo, Wangmeng ;
Dong, Chao .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1604-1613
[7]   MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video [J].
Guan, Zhenyu ;
Xing, Qunliang ;
Xu, Mai ;
Yang, Ren ;
Liu, Tie ;
Wang, Zulin .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) :949-963
[8]   Video Super-Resolution with Recurrent Structure-Detail Network [J].
Isobe, Takashi ;
Jia, Xu ;
Gu, Shuhang ;
Li, Songjiang ;
Wang, Shengjin ;
Tian, Qi .
COMPUTER VISION - ECCV 2020, PT XII, 2020, 12357 :645-660
[9]   Video Super-resolution with Temporal Group Attention [J].
Isobe, Takashi ;
Li, Songjiang ;
Jia, Xu ;
Yuan, Shanxin ;
Slabaugh, Gregory ;
Xu, Chunjing ;
Li, Ya-Li ;
Wang, Shengjin ;
Tian, Qi .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8005-8014
[10]  
Kim J, 2016, PROC CVPR IEEE, P1637, DOI [10.1109/CVPR.2016.182, 10.1109/CVPR.2016.181]