BIG-OH: BInarization of gradient orientation histograms

被引:16
作者
Baber, Junaid [1 ,2 ]
Dailey, Matthew N. [2 ]
Satoh, Shin'ichi [3 ]
Afzulpurkar, Nitin [2 ]
Bakhtyar, Maheen [1 ]
机构
[1] Univ Balochistan, Dept Comp Sci & Informat Technol, Quetta, Pakistan
[2] Asian Inst Technol, Sch Engn & Technol, Pathum Thani 12120, Thailand
[3] Natl Inst Informat, Tokyo, Japan
关键词
Gradient orientation histograms; SIFT; Gradient based keypoint descriptors; Keypoint descriptor quantization; SCALE;
D O I
10.1016/j.imavis.2014.08.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting local keypoints and keypoint descriptions from images is a primary step for many computer vision and image retrieval applications. In the literature, many researchers have proposed methods for representing local texture around keypoints with varying levels of robustness to photometric and geometric transformations. Gradient-based descriptors such as the Scale Invariant Feature Transform (SIFT) are among the most consistent and robust descriptors. The SIFT descriptor, a 128-element vector consisting of multiple gradient histograms computed from local image patches around a keypoint, is widely considered as the gold standard keypoint descriptor. However, SIFT descriptors require at least 128 bytes of storage per descriptor. Since images are typically described by thousands of keypoints, it may require more space to store the SIFT descriptors for an image than the original image itself. This may be prohibitive in extremely large-scale applications and applications on memory-constrained devices such as tablets and smartphones. In this paper, with the goal of reducing the memory requirements of keypoint descriptors such as SIFT, without affecting their performance, we propose BIG-OH, a simple yet extremely effective method for binary quantization of any descriptor based on gradient orientation histograms. BIG-OH's memory requirements are very small when it uses SIFT's default parameters for the construction of the gradient orientation histograms, it only requires 16 bytes per descriptor. BIG-OH quantizes gradient orientation histograms by computing a bit vector representing the relative magnitudes of local gradients associated with neighboring orientation bins. In a series of experiments on keypoint matching with different types of keypoint detectors under various photometric and geometric transformations, we find that the quantized descriptor has performance comparable to or better than other descriptors, including BRISK, CARD, BRIEF, D-BRIEF, SQ, and PCA-SIFT. Our experiments also show that BIG-OH is extremely effective for image retrieval, with modestly better performance than SIFT. BIG-OH's drastic reduction in memory requirements, obtained while preserving or improving the image matching and image retrieval performance of SIFT, makes it an excellent descriptor for large image databases and applications running on memory-constrained devices. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:940 / 953
页数:14
相关论文
共 46 条
[1]   Face description with local binary patterns:: Application to face recognition [J].
Ahonen, Timo ;
Hadid, Abdenour ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041
[2]  
Ambai M., 2011, INT C COMP VIS ICCV
[3]  
[Anonymous], 2012, P 20 ACM MULTIMEDIA
[4]  
Baber J., 2011, 2011 7th International Conference on Emerging Technologies, ICET 2011, P1
[5]  
Baber J., 2011, INT C DIG SIGN PROC
[6]   A FRAMEWORK FOR VIDEO SEGMENTATION USING GLOBAL AND LOCAL FEATURES [J].
Baber, Junaid ;
Afzulpurkar, Nitin ;
Satoh, Shin'ichi .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (05)
[7]  
Baumberg A, 2000, PROC CVPR IEEE, P774, DOI 10.1109/CVPR.2000.855899
[8]   Speeded-Up Robust Features (SURF) [J].
Bay, Herbert ;
Ess, Andreas ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359
[9]   In defense of Nearest-Neighbor based image classification [J].
Boiman, Oren ;
Shechtman, Eli ;
Irani, Michal .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, :1992-+
[10]   BRIEF: Binary Robust Independent Elementary Features [J].
Calonder, Michael ;
Lepetit, Vincent ;
Strecha, Christoph ;
Fua, Pascal .
COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 :778-792