OVQE: Omniscient Network for Compressed Video Quality Enhancement

被引:11
作者
Peng, Liuhan [1 ]
Hamdulla, Askar [1 ]
Ye, Mao [2 ]
Li, Shuai [3 ]
Wang, Zengbin [2 ]
Li, Xue [4 ]
机构
[1] Xinjiang Univ, Inst Informat Sci & Engn, Urumqi 830049, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250000, Peoples R China
[4] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
基金
中国国家自然科学基金;
关键词
Spatiotemporal phenomena; Video recording; Quality assessment; Frequency-domain analysis; Correlation; Task analysis; Iterative methods; Compressed video; video quality enhancement; omniscient network; deep learning;
D O I
10.1109/TBC.2022.3208426
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
How to use information from temporal, spatial, and frequency domain dimensions is crucial for the quality enhancement of compressed video. The state-of-the-art methods generally design powerful networks to fuse the spatiotemporal information of the videos. But the spatiotemporal information of the entire video is not fully utilized and effectively fused, resulting in the learned context information that is not closely related to the target frame. In addition, various compressed videos have varying degrees of frequency domain information loss. The previous methods ignored the non-uniform distortion of compressed video in different frequency domains and did not design unique algorithms for different frequency domains, so the real texture details of the video could not be restored. In this paper, we propose an omniscient network, which learns video spatiotemporal and omni-frequency information more effectively. The omniscient network consists of two novel components: a Spatio-Temporal Feature Fusion (STFF) module and an Omni-Frequency Adaptive Enhancement (OFAE) block. The former aims to capture spatiotemporal information in adjacent frames, while the latter aims to adaptively recover different frequency domains of compressed video. The information is designed to be bidirectionally propagated in a grid manner such that the omni-enhanced results can be applied. Extensive experiments show that our method outperforms the state-of-the-art method in terms of objective metrics, subjective visual effects, and model complexity.
引用
收藏
页码:153 / 164
页数:12
相关论文
共 50 条
[31]   Compressed Video Sensing Based on Deep Generative Adversarial Network [J].
Nezhad, Valiyeh Ansarian ;
Azghani, Masoumeh ;
Marvasti, Farokh .
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (08) :5048-5064
[32]   A video compression-cum-classification network for classification from compressed video streams [J].
Yadav, Sangeeta ;
Gulia, Preeti ;
Gill, Nasib Singh ;
Yahya, Mohammad ;
Shukla, Piyush Kumar ;
Pareek, Piyush Kumar ;
Shukla, Prashant Kumar .
VISUAL COMPUTER, 2024, 40 (11) :7539-7558
[33]   GEVE: A generative adversarial network for extremely dark image/video enhancement [J].
Anitha, C. ;
Kumar, R. Mathusoothana S. .
PATTERN RECOGNITION LETTERS, 2022, 155 :159-164
[34]   Artifacts Reduction GAN For Fnhancing Quality Of Compressed Panoramic Video [J].
Wang, Xueshu ;
Jing, Xiaojun ;
Huang, Hai ;
Cui, Yuanhao ;
Kadoch, Michel ;
Cheriet, Mohamed .
2020 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2020,
[35]   Quality Assessment of Compressed Video for Automatic License Plate Recognition [J].
Ukhanova, Anna ;
Stottrup-Andersen, Jesper ;
Forchhammer, Soren ;
Madsen, John .
PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 3, 2014, :306-313
[36]   Spatiotemporal Representation Learning for Blind Video Quality Assessment [J].
Liu, Yongxu ;
Wu, Jinjian ;
Li, Leida ;
Dong, Weisheng ;
Zhang, Jinpeng ;
Shi, Guangming .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) :3500-3513
[37]   Fast Video Quality Enhancement using GANs [J].
Galteri, Leonardo ;
Seidenari, Lorenzo ;
Bertini, Marco ;
Uricchio, Tiberio ;
Del Bimbo, Alberto .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :1065-1067
[38]   Enhanced Video Super-Resolution Network towards Compressed Data [J].
Li, Feng ;
Wu, Yixuan ;
Li, Anqi ;
Bai, Huihui ;
Cong, Runmin ;
Zhao, Yao .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
[39]   Exploring Spatial Frequency Information for Enhanced Video Prediction Quality [J].
Lai, Junyu ;
Gan, Lianqiang ;
Zhu, Junhong ;
Liu, Huashuo ;
Gao, Lianli .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :8955-8968
[40]   The Compressed Average Image Intensity metric for stereoscopic video quality assessment [J].
Wilczewski, Grzegorz .
PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2016, 2016, 10031