CodingHomo: Bootstrapping Deep Homography With Video Coding

被引：0

作者：

Liu, Yike ^{[1
]}

Li, Haipeng ^{[1
]}

Liu, Shuaicheng ^{[1
]}

Zeng, Bing ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Sichuan, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Video coding; deep homography; motion vector; image alignment; SEARCH ALGORITHM;

D O I：

10.1109/TCSVT.2024.3418771

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Homography estimation is a fundamental task in computer vision with applications in diverse fields. Recent advances in deep learning have improved homography estimation, particularly with unsupervised learning approaches, offering increased robustness and generalizability. However, accurately predicting homography, especially in complex motions, remains a challenge. In response, this work introduces a novel method leveraging video coding, particularly by harnessing inherent motion vectors (MVs) present in videos. We present CodingHomo, an unsupervised framework for homography estimation. Our framework features a Mask-Guided Fusion (MGF) module that identifies and utilizes beneficial features among the MVs, thereby enhancing the accuracy of homography prediction. Additionally, the Mask-Guided Homography Estimation (MGHE) module is presented for eliminating undesired features in the coarse-to-fine homography refinement process. CodingHomo outperforms existing stateof-the-art unsupervised methods, delivering good robustness and generalizability. The code and dataset are available at: https://github.com/liuyike422/CodingHomo.

引用

页码：11214 / 11228

页数：15

共 50 条

[1] Homography-based block motion estimation for video coding of PTZ cameras
Guo, Xiaoming
Jiang, Guang
Cui, Zhaopeng
Tao, Pei
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 39 : 164 - 171
[2] Unsupervised Deep Homography: A Fast and Robust Homography Estimation Model
Ty Nguyen
Chen, Steven W.
Shivakumar, Shreyas S.
Taylor, Camillo Jose
Kumar, Vijay
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (03): : 2346 - 2353
[3] A NOVEL HOMOGRAPHY-BASED SEARCH ALGORITHM FOR BLOCK MOTION ESTIMATION IN VIDEO CODING
Cui, Zhaopeng
Jiang, Guang
Wang, Dujuan
Wu, Chengke
2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
[4] Motion-Aware Deep Video Coding Network
Khan, Rida
Liu, Ying
BIG DATA II: LEARNING, ANALYTICS, AND APPLICATIONS, 2020, 11395
[5] A LIGHTWEIGHT MODEL FOR DEEP FRAME PREDICTION IN VIDEO CODING
Choi, Hyomin
Bajic, Ivan, V
2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 1122 - 1126
[6] GOP-based Deep Preprocessing for Video Coding
Arai, Daichi
Iwamura, Shunsuke
Iguchi, Kazuhisa
Ichigaya, Atsuro
2024 PICTURE CODING SYMPOSIUM, PCS 2024, 2024,
[7] Deep learning-based video quality enhancement for the new versatile video coding
Bouaafia, Soulef
Khemiri, Randa
Messaoud, Seifeddine
Ben Ahmed, Olfa
Sayadi, Fatma Ezahra
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17) : 14135 - 14149
[8] Deep learning-based video quality enhancement for the new versatile video coding
Soulef Bouaafia
Randa Khemiri
Seifeddine Messaoud
Olfa Ben Ahmed
Fatma Ezahra Sayadi
Neural Computing and Applications, 2022, 34 : 14135 - 14149
[9] CodingFlow: Enable Video Coding for Video Stabilization
Liu, Shuaicheng
Li, Mingyu
Zhu, Shuyuan
Zeng, Bing
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) : 3291 - 3302
[10] DEEP VIRTUAL REFERENCE FRAME GENERATION FOR MULTIVIEW VIDEO CODING
Lei, Jianjun
Zhang, Zongqian
Liu, Dong
Chen, Ying
Ling, Nam
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1123 - 1127

← 1 2 3 4 5 →