Deep convolutional features for image retrieval

被引:39
作者
Gkelios, Socratis [1 ]
Sophokleous, Aphrodite [2 ,3 ]
Plakias, Spiros [1 ]
Boutalis, Yiannis [1 ]
Chatzichristofis, Savvas A. [2 ,3 ]
机构
[1] Democritus Univ Thrace, Dept Elect & Comp Engn, Xanthi 67100, Greece
[2] Neapolis Univ Pafos, Intelligent Syst Lab, 2 Danais Ave, CY-8042 Pafos, Cyprus
[3] Neapolis Univ Pafos, Dept Comp Sci, 2 Danais Ave, CY-8042 Pafos, Cyprus
关键词
Image retrieval; Deep convolutional features; Deep learning; CNN; Global features; Local features; CBIR; CNN; CLASSIFICATION;
D O I
10.1016/j.eswa.2021.114940
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, the use of Convolutional Neural Networks (CNNs) has led to tremendous achievements in several computer vision challenges. CNN-based image retrieval methods vary in complexity, growing capacity, and execution time. This work presents a state-of-the-art review in Deep Convolutional Features for image retrieval, pointing out their scope, advantages, and limitations. Moreover, the paper presents a procedure that adopts the latest architectures of pre-trained CNNs that have been initially proposed for image classification to shape image retrieval features. It investigates their suitability on several image retrieval tasks, without any optimization procedure, exhaustive preparatory work, and tuning. Each network's performance is evaluated in two different setups: one employing global and one using local representations. Extensive experiments on several well-known benchmark datasets demonstrate that a simple normalization on the pre-trained networks yields results comparable to state-of-the-art approaches. The global descriptor shapes a plug-and-play approach, which can be adopted for description and retrieval without any prior initialization or training. Moreover, the descriptor's localized version outperforms significantly much more sophisticated and complex methods of the recent literature.
引用
收藏
页数:17
相关论文
共 79 条
[1]  
[Anonymous], 2014, CoRR
[2]  
[Anonymous], 2012, MULTIMED TOOLS APPL
[3]  
[Anonymous], 2018, SIGNAL PROCESSING IM
[4]   Dynamic two-stage image retrieval from large multimedia databases [J].
Arampatzis, Avi ;
Zagoris, Konstantinos ;
Chatzichristofis, Savvas A. .
INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (01) :274-285
[5]  
Azizpour Hossein, 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P36, DOI 10.1109/CVPRW.2015.7301270
[6]   Neural Codes for Image Retrieval [J].
Babenko, Artem ;
Slesarev, Anton ;
Chigorin, Alexandr ;
Lempitsky, Victor .
COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :584-599
[7]   SURF: Speeded up robust features [J].
Bay, Herbert ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417
[8]   Unifying Deep Local and Global Features for Image Search [J].
Cao, Bingyi ;
Araujo, Andre ;
Sim, Jack .
COMPUTER VISION - ECCV 2020, PT XX, 2020, 12365 :726-743
[9]  
Chatzichristofis SA, 2008, LECT NOTES COMPUT SC, V5008, P312
[10]   Co.Vi.Wo.: Color Visual Words Based on Non-Predefined Size Codebooks [J].
Chatzichristofis, Savvas A. ;
Iakovidou, Chryssanthi ;
Boutalis, Yiannis ;
Marques, Oge .
IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (01) :192-205