OVQE: Omniscient Network for Compressed Video Quality Enhancement

被引:14
作者
Peng, Liuhan [1 ]
Hamdulla, Askar [1 ]
Ye, Mao [2 ]
Li, Shuai [3 ]
Wang, Zengbin [2 ]
Li, Xue [4 ]
机构
[1] Xinjiang Univ, Inst Informat Sci & Engn, Urumqi 830049, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250000, Peoples R China
[4] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
基金
中国国家自然科学基金;
关键词
Spatiotemporal phenomena; Video recording; Quality assessment; Frequency-domain analysis; Correlation; Task analysis; Iterative methods; Compressed video; video quality enhancement; omniscient network; deep learning;
D O I
10.1109/TBC.2022.3208426
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
How to use information from temporal, spatial, and frequency domain dimensions is crucial for the quality enhancement of compressed video. The state-of-the-art methods generally design powerful networks to fuse the spatiotemporal information of the videos. But the spatiotemporal information of the entire video is not fully utilized and effectively fused, resulting in the learned context information that is not closely related to the target frame. In addition, various compressed videos have varying degrees of frequency domain information loss. The previous methods ignored the non-uniform distortion of compressed video in different frequency domains and did not design unique algorithms for different frequency domains, so the real texture details of the video could not be restored. In this paper, we propose an omniscient network, which learns video spatiotemporal and omni-frequency information more effectively. The omniscient network consists of two novel components: a Spatio-Temporal Feature Fusion (STFF) module and an Omni-Frequency Adaptive Enhancement (OFAE) block. The former aims to capture spatiotemporal information in adjacent frames, while the latter aims to adaptively recover different frequency domains of compressed video. The information is designed to be bidirectionally propagated in a grid manner such that the omni-enhanced results can be applied. Extensive experiments show that our method outperforms the state-of-the-art method in terms of objective metrics, subjective visual effects, and model complexity.
引用
收藏
页码:153 / 164
页数:12
相关论文
共 50 条
[41]   Spatiotemporal Representation Learning for Blind Video Quality Assessment [J].
Liu, Yongxu ;
Wu, Jinjian ;
Li, Leida ;
Dong, Weisheng ;
Zhang, Jinpeng ;
Shi, Guangming .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) :3500-3513
[42]   Fast Video Quality Enhancement using GANs [J].
Galteri, Leonardo ;
Seidenari, Lorenzo ;
Bertini, Marco ;
Uricchio, Tiberio ;
Del Bimbo, Alberto .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :1065-1067
[43]   Exploring Spatial Frequency Information for Enhanced Video Prediction Quality [J].
Lai, Junyu ;
Gan, Lianqiang ;
Zhu, Junhong ;
Liu, Huashuo ;
Gao, Lianli .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :8955-8968
[44]   Enhanced Video Super-Resolution Network towards Compressed Data [J].
Li, Feng ;
Wu, Yixuan ;
Li, Anqi ;
Bai, Huihui ;
Cong, Runmin ;
Zhao, Yao .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
[45]   The Compressed Average Image Intensity metric for stereoscopic video quality assessment [J].
Wilczewski, Grzegorz .
PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2016, 2016, 10031
[46]   Multiview video quality enhancement without depth information [J].
Jammal, Samer ;
Tillo, Tammam ;
Xiao, Jimin .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 75 :22-31
[47]   QEVC: Quality Enhancement-Oriented Video Coding [J].
Li, Hao ;
Lei, Weimin ;
Zhang, Wei .
2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, :296-300
[48]   HEVC Video Quality Enhancement Using Deep Learning with Super Interpolation and Laplacian Filter [J].
Sheeba, G. ;
Maheswari, M. .
IETE JOURNAL OF RESEARCH, 2023, 69 (11) :7979-7992
[49]   LAE-Net: Light and Efficient Network for Compressed Video Action Recognition [J].
Guo, Jinxin ;
Zhang, Jiaqiang ;
Zhang, Xiaojing ;
Ma, Ming .
MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 :265-276
[50]   Video Quality Assessment of Danmaku-Based Video Saliency Regions [J].
Cao, Lina ;
Guo, Dongliang ;
Wang, Quan ;
Feng, Li ;
Shi, Chuanbao .
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 :2213-2217