CodingHomo: Bootstrapping Deep Homography With Video Coding

被引：0

作者：

Liu, Yike ^{[1
]}

Li, Haipeng ^{[1
]}

Liu, Shuaicheng ^{[1
]}

Zeng, Bing ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Sichuan, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Video coding; deep homography; motion vector; image alignment; SEARCH ALGORITHM;

D O I：

10.1109/TCSVT.2024.3418771

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Homography estimation is a fundamental task in computer vision with applications in diverse fields. Recent advances in deep learning have improved homography estimation, particularly with unsupervised learning approaches, offering increased robustness and generalizability. However, accurately predicting homography, especially in complex motions, remains a challenge. In response, this work introduces a novel method leveraging video coding, particularly by harnessing inherent motion vectors (MVs) present in videos. We present CodingHomo, an unsupervised framework for homography estimation. Our framework features a Mask-Guided Fusion (MGF) module that identifies and utilizes beneficial features among the MVs, thereby enhancing the accuracy of homography prediction. Additionally, the Mask-Guided Homography Estimation (MGHE) module is presented for eliminating undesired features in the coarse-to-fine homography refinement process. CodingHomo outperforms existing stateof-the-art unsupervised methods, delivering good robustness and generalizability. The code and dataset are available at: https://github.com/liuyike422/CodingHomo.

引用

页码：11214 / 11228

页数：15

共 50 条

[21] Deep Learning-Based Luma and Chroma Fractional Interpolation in Video Coding
Pham, Chi Do-Kim
Zhou, Jinjia
IEEE ACCESS, 2019, 7 : 112535 - 112543
[22] Deep Multi-Domain Prediction for 3D Video Coding
Lei, Jianjun
Shi, Yanan
Pan, Zhaoqing
Liu, Dong
Jin, Dengchao
Chen, Ying
Ling, Nam
IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (04) : 813 - 823
[23] Deep Integer-Position Samples Refinement for Motion Compensation of Video Coding
Xia, Sifeng
Hu, Yueyu
Liu, Jiaying
DIGITAL TV AND MULTIMEDIA COMMUNICATION, 2019, 1009 : 391 - 400
[24] On Predicting Bottlenecks in Wavefront Parallel Video Coding Using Deep Neural Networks
Panagou, Natalia
Oikonomou, Panagiotis
Papadopoulos, Panos K.
Koziri, Maria
Loukopoulos, Thanasis
Iakovidis, Dimitris
ENGINEERING APPLICATIONS OF NEURAL NETWORKSX, 2019, 1000 : 501 - 510
[25] Multiple Resolution Prediction With Deep Up-Sampling for Depth Video Coding
Li, Ge
Lei, Jianjun
Pan, Zhaoqing
Peng, Bo
Ling, Nam
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6337 - 6346
[26] Wyner-Ziv Video Coding using Hadamard Transform and Deep Learning
Kouma, Jean-Paul
Soderstrom, Ulrik
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (07) : 582 - 589
[27] Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding
Zhu, Linwei
Zhang, Yun
Wang, Shiqi
Kwong, Sam
Jin, Xin
Qiao, Yu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3168 - 3181
[28] Static Video Summarization Using Video Coding Features with Frame-Level Temporal Subsampling and Deep Learning
Issa, Obada
Shanableh, Tamer
APPLIED SCIENCES-BASEL, 2023, 13 (10):
[29] RETRACTED ARTICLE: An Improvised video coding algorithm for deep learning-based video transmission using HEVC
A. Srinivasan
G. Rohini
Soft Computing, 2019, 23 : 8503 - 8514
[30] Predictive Patch Matching Method For Inter Frame Coding In Advanced Video Coding
Talawar, Tejashwini
Naik, N. Manja
Parameshachari, B. D.
Banu, Reshma
Rajashekarappa
2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 918 - 922

← 1 2 3 4 5 →