CDbin: Compact Discriminative Binary Descriptor Learned With Efficient Neural Network

被引：20

作者：

Ye, Jianming ^{[1
]}

Zhang, Shiliang ^{[1
]}

Huang, Tiejun ^{[1
]}

Rui, Yong ^{[2
]}

机构：

[1] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China

[2] Lenovo Grp Ltd, Beijing 100871, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2020年 / 30卷 / 03期

基金：

北京市自然科学基金;

关键词：

Feature extraction; Binary codes; Training; Quantization (signal); Neural networks; Task analysis; Correlation; Image patch matching; binary descriptor; convolutional neural networks;

D O I：

10.1109/TCSVT.2019.2896095

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

As an important computer vision task, image matching requires efficient and discriminative local descriptors. Most of the existing descriptors like SIFT and ORB are hand-crafted; therefore it is necessary to study more optimized descriptors through end-to-end learning. This paper proposes the compact binary descriptors learned with a lightweight Convolutional Neural Network (CNN), which is efficient for training and testing. Specifically, we propose a CNN with no larger than five layers for descriptor learning. The resulting descriptors, i.e., Compact Discriminative binary descriptors (CDbin) are optimized with four complementary loss functions, i.e., 1) triplet loss to ensure the discriminative power; 2) quantization loss to decrease the quantization error; 3) correlation loss to ensure the feature compactness; and 4) even-distribution loss to enrich the embedded information. The extensive experiments on two image patch datasets and three image retrieval datasets show that the CDbin exhibits competitive performance compared with the existing descriptors. For example, the 64-bit CDbin substantially outperforms the 256-bit ORB and 1024-bit SIFT on Hpatches dataset. Although generated by a shallow CNN, CDbin also outperforms several recent deep descriptors.

引用

页码：862 / 874

页数：13

共 45 条

[1] Learning to Match Aerial Images with Deep Attentive Architectures [J].

Altwaijry, Hani ;

Trulls, Eduard ;

Hays, James ;

Fua, Pascal ;

Belongie, Serge .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3539-3547

[2]

[Anonymous], 2016, P BRIT MACH VIS C

[3]

[Anonymous], ADV NEURAL INFORM PR

[4]

Arandjelovic R, 2012, PROC CVPR IEEE, P2911, DOI 10.1109/CVPR.2012.6248018

[5] HPatches: A benchmark and evaluation of handcrafted and learned local descriptors [J].

Balntas, Vassileios ;

Lenc, Karel ;

Vedaldi, Andrea ;

Mikolajczyk, Krystian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3852-3861

[6]

Balntas V, 2015, PROC CVPR IEEE, P2367, DOI 10.1109/CVPR.2015.7298850

[7] SURF: Speeded up robust features [J].

Bay, Herbert ;

Tuytelaars, Tinne ;

Van Gool, Luc .

COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417

[8] Unsupervised 3D object recognition and reconstruction in unordered datasets [J].

Brown, M ;

Lowe, DG .

FIFTH INTERNATIONAL CONFERENCE ON 3-D DIGITAL IMAGING AND MODELING, PROCEEDINGS, 2005, :56-+

[9] Discriminative Learning of Local Image Descriptors [J].

Brown, Matthew ;

Hua, Gang ;

Winder, Simon .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (01) :43-57

[10] BRIEF: Binary Robust Independent Elementary Features [J].

Calonder, Michael ;

Lepetit, Vincent ;

Strecha, Christoph ;

Fua, Pascal .

COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 :778-792

← 1 2 3 4 5 →