CodingHomo: Bootstrapping Deep Homography With Video Coding

被引：0

作者：

Liu, Yike ^{[1
]}

Li, Haipeng ^{[1
]}

Liu, Shuaicheng ^{[1
]}

Zeng, Bing ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Sichuan, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Video coding; deep homography; motion vector; image alignment; SEARCH ALGORITHM;

D O I：

10.1109/TCSVT.2024.3418771

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Homography estimation is a fundamental task in computer vision with applications in diverse fields. Recent advances in deep learning have improved homography estimation, particularly with unsupervised learning approaches, offering increased robustness and generalizability. However, accurately predicting homography, especially in complex motions, remains a challenge. In response, this work introduces a novel method leveraging video coding, particularly by harnessing inherent motion vectors (MVs) present in videos. We present CodingHomo, an unsupervised framework for homography estimation. Our framework features a Mask-Guided Fusion (MGF) module that identifies and utilizes beneficial features among the MVs, thereby enhancing the accuracy of homography prediction. Additionally, the Mask-Guided Homography Estimation (MGHE) module is presented for eliminating undesired features in the coarse-to-fine homography refinement process. CodingHomo outperforms existing stateof-the-art unsupervised methods, delivering good robustness and generalizability. The code and dataset are available at: https://github.com/liuyike422/CodingHomo.

引用

页码：11214 / 11228

页数：15

共 50 条

[31] Role of Nanotechnology in Lossless Video Compression Coding
Loukil, H.
Abbas, M.
Algahtani, Ali
Kessentini, A.
Muneer, P.
Ijyas, Thafasal
Wase, M. Abdul
NANOSCIENCE AND NANOTECHNOLOGY LETTERS, 2019, 11 (12) : 1617 - 1632
[32] Efficient Motion Estimation Algorithms for Video Coding
Wu, Ming-Te
ADVANCED RESEARCH ON AUTOMATION, COMMUNICATION, ARCHITECTONICS AND MATERIALS, PTS 1 AND 2, 2011, 225-226 (1-2): : 953 - 956
[33] TENSOR VIDEO CODING
Mahfoodh, Abo Talib
Radha, Hayder
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 1724 - 1728
[34] Video Coding for Machine
Gao, Wen
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1 - 1
[35] The Future of Video Coding
Ling, Nam
Kuo, C. -C. Jay
Sullivan, Gary J.
Xu, Dong
Liu, Shan
Hang, Hsueh-Ming
Peng, Wen-Hsiao
Liu, Jiaying
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2022, 11 (01)
[36] Distributed video coding
Girod, B
Margot, A
Rane, S
Rebollo-Monedero, D
PROCEEDINGS OF THE IEEE, 2005, 93 (01) : 71 - 83
[37] Enhanced Motion-Compensated Video Coding With Deep Virtual Reference Frame Generation
Zhao, Lei
Wang, Shiqi
Zhang, Xinfeng
Wang, Shanshe
Ma, Siwei
Gao, Wen
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 4832 - 4844
[38] Deep Video Prediction Network-ased Inter-Frame Coding in HEVC
Lee, Jung-Kyung
Kim, Nayoung
Cho, Seunghyun
Kang, Je-Won
IEEE ACCESS, 2020, 8 : 95906 - 95917
[39] Low-Complexity Error Resilient HEVC Video Coding: A Deep Learning Approach
Wang, Taiyu
Li, Fan
Qiao, Xiaoya
Cosman, Pamela C.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1245 - 1260
[40] Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding
Jia, Jianghao
Liu, Zizheng
Xu, Xiaozhong
Liu, Shan
Chen, Zhenzhong
2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,

← 1 2 3 4 5 →