CodingHomo: Bootstrapping Deep Homography With Video Coding

被引:0
作者
Liu, Yike [1 ]
Li, Haipeng [1 ]
Liu, Shuaicheng [1 ]
Zeng, Bing [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Video coding; deep homography; motion vector; image alignment; SEARCH ALGORITHM;
D O I
10.1109/TCSVT.2024.3418771
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Homography estimation is a fundamental task in computer vision with applications in diverse fields. Recent advances in deep learning have improved homography estimation, particularly with unsupervised learning approaches, offering increased robustness and generalizability. However, accurately predicting homography, especially in complex motions, remains a challenge. In response, this work introduces a novel method leveraging video coding, particularly by harnessing inherent motion vectors (MVs) present in videos. We present CodingHomo, an unsupervised framework for homography estimation. Our framework features a Mask-Guided Fusion (MGF) module that identifies and utilizes beneficial features among the MVs, thereby enhancing the accuracy of homography prediction. Additionally, the Mask-Guided Homography Estimation (MGHE) module is presented for eliminating undesired features in the coarse-to-fine homography refinement process. CodingHomo outperforms existing stateof-the-art unsupervised methods, delivering good robustness and generalizability. The code and dataset are available at: https://github.com/liuyike422/CodingHomo.
引用
收藏
页码:11214 / 11228
页数:15
相关论文
共 50 条
[41]   A New Approach to Video Coding Leveraging Hybrid Coding and Video Frame Interpolation [J].
Brascher, Andre Beims ;
da Silveira, Gabriela Furtado ;
Cancellier, Luiz Henrique ;
Seidel, Ismael ;
Grellert, Mateus ;
Guntzel, Jose Luis .
2023 36TH SBC/SBMICRO/IEEE/ACM SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, SBCCI, 2023, :161-166
[42]   Mode Dependent Coding Tools for Video Coding [J].
Ma, Siwei ;
Wang, Shiqi ;
Yu, Qin ;
Si, Junjun ;
Gao, Wen .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (06) :990-1000
[43]   Estimation of video sequence homography based on a first-order estimation method [J].
Zheng, Qixuan ;
Li, Muyu ;
Yan, Hong .
PATTERN RECOGNITION, 2025, 158
[44]   Multiview-Video-Plus-Depth Coding Based on the Advanced Video Coding Standard [J].
Hannuksela, Miska M. ;
Rusanovskyy, Dmytro ;
Su, Wenyi ;
Chen, Lulu ;
Li, Ri ;
Aflaki, Payman ;
Lan, Deyan ;
Joachimiak, Michal ;
Li, Houqiang ;
Gabbouj, Moncef .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (09) :3449-3458
[45]   Video Coding Algorithm Based on High Efficiency Video Coding (HEVC) and Hybrid Transforms [J].
Wang, Chengyou ;
Shan, Rongyang ;
Zhou, Xiao .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (09) :4448-4466
[46]   DBVC: An End-to-End 3-D Deep Biomedical Video Coding Framework [J].
Xue, Dongmei ;
Ma, Haichuan ;
Li, Li ;
Liu, Dong ;
Xiong, Zhiwei ;
Li, Houqiang .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) :2922-2933
[47]   ADAPTIVE INTRA PERIOD SIZE FOR DEEP LEARNING-BASED SCREEN CONTENT VIDEO CODING [J].
Wu, Yuyang ;
Xie, Liang ;
Sun, Shangkun ;
Gao, Wei ;
Yan, Yiqiang .
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
[48]   Extended Coding Unit Partitioning for Future Video Coding [J].
Wang, Meng ;
Li, Junru ;
Zhang, Li ;
Zhang, Kai ;
Liu, Hongbin ;
Wang, Shiqi ;
Kwong, Sam ;
Ma, Siwei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :2931-2946
[49]   Quantization Matrix Coding for High Efficiency Video Coding [J].
Mo, Yijun ;
Xiong, Jiaji ;
Chen, Jianwen ;
Xu, Feng .
ADVANCES ON DIGITAL TELEVISION AND WIRELESS MULTIMEDIA COMMUNICATIONS, 2012, 331 :244-+
[50]   Isolated regions in video coding [J].
Hannuksela, MM ;
Wang, YK ;
Gabbouj, M .
IEEE TRANSACTIONS ON MULTIMEDIA, 2004, 6 (02) :259-267