CodingHomo: Bootstrapping Deep Homography With Video Coding

被引:0
作者
Liu, Yike [1 ]
Li, Haipeng [1 ]
Liu, Shuaicheng [1 ]
Zeng, Bing [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Video coding; deep homography; motion vector; image alignment; SEARCH ALGORITHM;
D O I
10.1109/TCSVT.2024.3418771
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Homography estimation is a fundamental task in computer vision with applications in diverse fields. Recent advances in deep learning have improved homography estimation, particularly with unsupervised learning approaches, offering increased robustness and generalizability. However, accurately predicting homography, especially in complex motions, remains a challenge. In response, this work introduces a novel method leveraging video coding, particularly by harnessing inherent motion vectors (MVs) present in videos. We present CodingHomo, an unsupervised framework for homography estimation. Our framework features a Mask-Guided Fusion (MGF) module that identifies and utilizes beneficial features among the MVs, thereby enhancing the accuracy of homography prediction. Additionally, the Mask-Guided Homography Estimation (MGHE) module is presented for eliminating undesired features in the coarse-to-fine homography refinement process. CodingHomo outperforms existing stateof-the-art unsupervised methods, delivering good robustness and generalizability. The code and dataset are available at: https://github.com/liuyike422/CodingHomo.
引用
收藏
页码:11214 / 11228
页数:15
相关论文
共 50 条
  • [31] Role of Nanotechnology in Lossless Video Compression Coding
    Loukil, H.
    Abbas, M.
    Algahtani, Ali
    Kessentini, A.
    Muneer, P.
    Ijyas, Thafasal
    Wase, M. Abdul
    NANOSCIENCE AND NANOTECHNOLOGY LETTERS, 2019, 11 (12) : 1617 - 1632
  • [32] Efficient Motion Estimation Algorithms for Video Coding
    Wu, Ming-Te
    ADVANCED RESEARCH ON AUTOMATION, COMMUNICATION, ARCHITECTONICS AND MATERIALS, PTS 1 AND 2, 2011, 225-226 (1-2): : 953 - 956
  • [33] TENSOR VIDEO CODING
    Mahfoodh, Abo Talib
    Radha, Hayder
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 1724 - 1728
  • [34] Video Coding for Machine
    Gao, Wen
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1 - 1
  • [35] The Future of Video Coding
    Ling, Nam
    Kuo, C. -C. Jay
    Sullivan, Gary J.
    Xu, Dong
    Liu, Shan
    Hang, Hsueh-Ming
    Peng, Wen-Hsiao
    Liu, Jiaying
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2022, 11 (01)
  • [36] Distributed video coding
    Girod, B
    Margot, A
    Rane, S
    Rebollo-Monedero, D
    PROCEEDINGS OF THE IEEE, 2005, 93 (01) : 71 - 83
  • [37] Enhanced Motion-Compensated Video Coding With Deep Virtual Reference Frame Generation
    Zhao, Lei
    Wang, Shiqi
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    Gao, Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 4832 - 4844
  • [38] Deep Video Prediction Network-ased Inter-Frame Coding in HEVC
    Lee, Jung-Kyung
    Kim, Nayoung
    Cho, Seunghyun
    Kang, Je-Won
    IEEE ACCESS, 2020, 8 : 95906 - 95917
  • [39] Low-Complexity Error Resilient HEVC Video Coding: A Deep Learning Approach
    Wang, Taiyu
    Li, Fan
    Qiao, Xiaoya
    Cosman, Pamela C.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1245 - 1260
  • [40] Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding
    Jia, Jianghao
    Liu, Zizheng
    Xu, Xiaozhong
    Liu, Shan
    Chen, Zhenzhong
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,