CodingHomo: Bootstrapping Deep Homography With Video Coding

被引:0
|
作者
Liu, Yike [1 ]
Li, Haipeng [1 ]
Liu, Shuaicheng [1 ]
Zeng, Bing [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Video coding; deep homography; motion vector; image alignment; SEARCH ALGORITHM;
D O I
10.1109/TCSVT.2024.3418771
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Homography estimation is a fundamental task in computer vision with applications in diverse fields. Recent advances in deep learning have improved homography estimation, particularly with unsupervised learning approaches, offering increased robustness and generalizability. However, accurately predicting homography, especially in complex motions, remains a challenge. In response, this work introduces a novel method leveraging video coding, particularly by harnessing inherent motion vectors (MVs) present in videos. We present CodingHomo, an unsupervised framework for homography estimation. Our framework features a Mask-Guided Fusion (MGF) module that identifies and utilizes beneficial features among the MVs, thereby enhancing the accuracy of homography prediction. Additionally, the Mask-Guided Homography Estimation (MGHE) module is presented for eliminating undesired features in the coarse-to-fine homography refinement process. CodingHomo outperforms existing stateof-the-art unsupervised methods, delivering good robustness and generalizability. The code and dataset are available at: https://github.com/liuyike422/CodingHomo.
引用
收藏
页码:11214 / 11228
页数:15
相关论文
共 50 条
  • [1] Homography-based block motion estimation for video coding of PTZ cameras
    Guo, Xiaoming
    Jiang, Guang
    Cui, Zhaopeng
    Tao, Pei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 39 : 164 - 171
  • [2] Unsupervised Deep Homography: A Fast and Robust Homography Estimation Model
    Ty Nguyen
    Chen, Steven W.
    Shivakumar, Shreyas S.
    Taylor, Camillo Jose
    Kumar, Vijay
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (03): : 2346 - 2353
  • [3] A NOVEL HOMOGRAPHY-BASED SEARCH ALGORITHM FOR BLOCK MOTION ESTIMATION IN VIDEO CODING
    Cui, Zhaopeng
    Jiang, Guang
    Wang, Dujuan
    Wu, Chengke
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [4] Motion-Aware Deep Video Coding Network
    Khan, Rida
    Liu, Ying
    BIG DATA II: LEARNING, ANALYTICS, AND APPLICATIONS, 2020, 11395
  • [5] A LIGHTWEIGHT MODEL FOR DEEP FRAME PREDICTION IN VIDEO CODING
    Choi, Hyomin
    Bajic, Ivan, V
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 1122 - 1126
  • [6] GOP-based Deep Preprocessing for Video Coding
    Arai, Daichi
    Iwamura, Shunsuke
    Iguchi, Kazuhisa
    Ichigaya, Atsuro
    2024 PICTURE CODING SYMPOSIUM, PCS 2024, 2024,
  • [7] Deep learning-based video quality enhancement for the new versatile video coding
    Bouaafia, Soulef
    Khemiri, Randa
    Messaoud, Seifeddine
    Ben Ahmed, Olfa
    Sayadi, Fatma Ezahra
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17) : 14135 - 14149
  • [8] Deep learning-based video quality enhancement for the new versatile video coding
    Soulef Bouaafia
    Randa Khemiri
    Seifeddine Messaoud
    Olfa Ben Ahmed
    Fatma Ezahra Sayadi
    Neural Computing and Applications, 2022, 34 : 14135 - 14149
  • [9] CodingFlow: Enable Video Coding for Video Stabilization
    Liu, Shuaicheng
    Li, Mingyu
    Zhu, Shuyuan
    Zeng, Bing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) : 3291 - 3302
  • [10] DEEP VIRTUAL REFERENCE FRAME GENERATION FOR MULTIVIEW VIDEO CODING
    Lei, Jianjun
    Zhang, Zongqian
    Liu, Dong
    Chen, Ying
    Ling, Nam
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1123 - 1127