Learning Neural Bag-of-Features for Large-Scale Image Retrieval

被引:16
作者
Passalis, Nikolaos [1 ]
Tefas, Anastasios [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 54124, Greece
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2017年 / 47卷 / 10期
关键词
Bag-of-features (BoFs) representation; information retrieval; neural networks; retrieval-oriented optimization; FACE RECOGNITION; INFORMATION; MODEL;
D O I
10.1109/TSMC.2017.2680404
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the well-known bag-of-features (BoFs) model is generalized and formulated as a neural network that is composed of three layers: 1) a radial basis function (RBF) layer; 2) an accumulation layer; and 3) a fully connected layer. This formulation allows for decoupling the representation size from the number of used codewords, as well as for better modeling the feature distribution using a separate trainable scaling parameter for each RBF neuron. The resulting network, called retrievaloriented neural BoF (RN-BoF), is trained using regular back propagation and allows for fast extraction of compact image representations. It is demonstrated that the RN-BoF model is capable of: 1) increasing the object encoding and retrieval speed; 2) reducing the extracted representation size; and 3) increasing the retrieval precision. A symmetry-aware spatial segmentation technique is also proposed to further reduce the encoding time and the storage requirements and allows the method to efficiently scale to large datasets. The proposed method is evaluated and compared to other state-of-the-art techniques using five different image datasets, including the large-scale YouTube Faces database.
引用
收藏
页码:2641 / 2652
页数:12
相关论文
共 54 条
[1]  
[Anonymous], 2013, International Conference on Machine Learning
[2]  
[Anonymous], P INNS C BIG DAT
[3]  
[Anonymous], P 9 HELL C ART INT S
[4]  
[Anonymous], J ELECT IMAGING
[5]  
[Anonymous], IEEE T KNOWL DATA EN
[6]  
[Anonymous], 2008, Int. Symp. on Music Information Retrieval (ISMIR)
[7]  
[Anonymous], ADV NEURAL INF PROCE
[8]  
[Anonymous], THEAN PYTH FRAM FAST
[9]  
[Anonymous], 2008, REAL LIF IM WORKSH E
[10]  
[Anonymous], 2008, INTRO INFORM RETRIEV, DOI DOI 10.1017/CBO9780511809071