Feature Point Detection and Description Networks Based on Asymmetric Convolution and the Cross-ResolutionImage-Matching Method

被引:0
作者
Zhang, Ruixing [1 ,2 ]
Yao, Tao [1 ,2 ]
Yan, Lianshan [1 ,2 ,3 ]
机构
[1] Ludong Univ, Sch Informat & Elect Engn, Yantai 264025, Peoples R China
[2] Southwest Jiaotong Univ, Yantai Res Inst New Generat Informat Technol, Yantai 264000, Peoples R China
[3] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 610101, Peoples R China
基金
中国国家自然科学基金;
关键词
IMAGE REGISTRATION;
D O I
10.1155/2023/5131440
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image matching can be transformed into the problem of feature point detection and matching of images. The current neural network methods have a weak detection effect on feature points and cannot extract enough sparse and uniform feature points. In order to improve the detection and description ability of feature points, this paper proposes a self-supervised feature point detection and description network based on asymmetric convolution: ACPoint. Specifically, first, feature point pseudolabels are learned from an unlabeled dataset, and pseudolabels are used for supervised learning; then, the learned model is used to update pseudolabels. Through multiple iterations of model training and label updating, high-quality labels and high-accuracy models are obtained adaptively. The asymmetric convolution feature point (ACPoint) network adopts an asymmetric convolution module to simultaneously train three convolution branches to learn more feature information, which uses two one-dimensional convolutions to enhance the backbone of square convolution from both horizontal and vertical directions and improve the representation of local features during inference. Based on the ACPoint network, a cross-resolution image-matching method is proposed. Experiments show that our proposed network model has higher localization accuracy and homography estimation ability on the HPatches dataset.
引用
收藏
页数:15
相关论文
共 43 条
[21]   Online Multi-modal Hashing with Dynamic Query-adaption [J].
Lu, Xu ;
Zhu, Lei ;
Cheng, Zhiyong ;
Nie, Liqiang ;
Zhang, Huaxiang .
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, :715-724
[22]   Image Matching from Handcrafted to Deep Features: A Survey [J].
Ma, Jiayi ;
Jiang, Xingyu ;
Fan, Aoxiang ;
Jiang, Junjun ;
Yan, Junchi .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (01) :23-79
[23]   ORB-SLAM: A Versatile and Accurate Monocular SLAM System [J].
Mur-Artal, Raul ;
Montiel, J. M. M. ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (05) :1147-1163
[24]   SIFT: predicting amino acid changes that affect protein function [J].
Ng, PC ;
Henikoff, S .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3812-3814
[25]   A view-free image stitching network based on global homography [J].
Nie, Lang ;
Lin, Chunyu ;
Liao, Kang ;
Liu, Meiqin ;
Zhao, Yao .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 73
[26]  
Pal P., 2021, DFT21 DISCRETE FOURI
[27]  
Rocco I, 2018, Arxiv, DOI arXiv:1810.10510
[28]   Machine learning for high-speed corner detection [J].
Rosten, Edward ;
Drummond, Tom .
COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :430-443
[29]   SuperGlue: Learning Feature Matching with Graph Neural Networks [J].
Sarlin, Paul-Edouard ;
DeTone, Daniel ;
Malisiewicz, Tomasz ;
Rabinovich, Andrew .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4937-4946
[30]   Discriminative Learning of Deep Convolutional Feature Point Descriptors [J].
Simo-Serra, Edgar ;
Trulls, Eduard ;
Ferraz, Luis ;
Kokkinos, Iasonas ;
Fua, Pascal ;
Moreno-Noguer, Francesc .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :118-126