CodingHomo: Bootstrapping Deep Homography With Video Coding

被引:0
作者
Liu, Yike [1 ]
Li, Haipeng [1 ]
Liu, Shuaicheng [1 ]
Zeng, Bing [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Video coding; deep homography; motion vector; image alignment; SEARCH ALGORITHM;
D O I
10.1109/TCSVT.2024.3418771
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Homography estimation is a fundamental task in computer vision with applications in diverse fields. Recent advances in deep learning have improved homography estimation, particularly with unsupervised learning approaches, offering increased robustness and generalizability. However, accurately predicting homography, especially in complex motions, remains a challenge. In response, this work introduces a novel method leveraging video coding, particularly by harnessing inherent motion vectors (MVs) present in videos. We present CodingHomo, an unsupervised framework for homography estimation. Our framework features a Mask-Guided Fusion (MGF) module that identifies and utilizes beneficial features among the MVs, thereby enhancing the accuracy of homography prediction. Additionally, the Mask-Guided Homography Estimation (MGHE) module is presented for eliminating undesired features in the coarse-to-fine homography refinement process. CodingHomo outperforms existing stateof-the-art unsupervised methods, delivering good robustness and generalizability. The code and dataset are available at: https://github.com/liuyike422/CodingHomo.
引用
收藏
页码:11214 / 11228
页数:15
相关论文
共 50 条
  • [21] Deep Learning-Based Luma and Chroma Fractional Interpolation in Video Coding
    Pham, Chi Do-Kim
    Zhou, Jinjia
    IEEE ACCESS, 2019, 7 : 112535 - 112543
  • [22] Deep Multi-Domain Prediction for 3D Video Coding
    Lei, Jianjun
    Shi, Yanan
    Pan, Zhaoqing
    Liu, Dong
    Jin, Dengchao
    Chen, Ying
    Ling, Nam
    IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (04) : 813 - 823
  • [23] Deep Integer-Position Samples Refinement for Motion Compensation of Video Coding
    Xia, Sifeng
    Hu, Yueyu
    Liu, Jiaying
    DIGITAL TV AND MULTIMEDIA COMMUNICATION, 2019, 1009 : 391 - 400
  • [24] On Predicting Bottlenecks in Wavefront Parallel Video Coding Using Deep Neural Networks
    Panagou, Natalia
    Oikonomou, Panagiotis
    Papadopoulos, Panos K.
    Koziri, Maria
    Loukopoulos, Thanasis
    Iakovidis, Dimitris
    ENGINEERING APPLICATIONS OF NEURAL NETWORKSX, 2019, 1000 : 501 - 510
  • [25] Multiple Resolution Prediction With Deep Up-Sampling for Depth Video Coding
    Li, Ge
    Lei, Jianjun
    Pan, Zhaoqing
    Peng, Bo
    Ling, Nam
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6337 - 6346
  • [26] Wyner-Ziv Video Coding using Hadamard Transform and Deep Learning
    Kouma, Jean-Paul
    Soderstrom, Ulrik
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (07) : 582 - 589
  • [27] Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding
    Zhu, Linwei
    Zhang, Yun
    Wang, Shiqi
    Kwong, Sam
    Jin, Xin
    Qiao, Yu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3168 - 3181
  • [28] Static Video Summarization Using Video Coding Features with Frame-Level Temporal Subsampling and Deep Learning
    Issa, Obada
    Shanableh, Tamer
    APPLIED SCIENCES-BASEL, 2023, 13 (10):
  • [29] RETRACTED ARTICLE: An Improvised video coding algorithm for deep learning-based video transmission using HEVC
    A. Srinivasan
    G. Rohini
    Soft Computing, 2019, 23 : 8503 - 8514
  • [30] Predictive Patch Matching Method For Inter Frame Coding In Advanced Video Coding
    Talawar, Tejashwini
    Naik, N. Manja
    Parameshachari, B. D.
    Banu, Reshma
    Rajashekarappa
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 918 - 922