Spatial-Semantic Image Search by Visual Feature Synthesis

被引:21
作者
Mai, Long [1 ]
Jin, Hailin [2 ]
Lin, Zhe [2 ]
Fang, Chen [2 ]
Brandt, Jonathan [2 ]
Liu, Feng [1 ]
机构
[1] Portland State Univ, Portland, OR 97207 USA
[2] Adobe Res, San Jose, CA USA
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
基金
美国国家科学基金会;
关键词
OF-THE-ART; RETRIEVAL;
D O I
10.1109/CVPR.2017.125
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of image retrieval has been improved tremendously in recent years through the use of deep feature representations. Most existing methods, however, aim to retrieve images that are visually similar or semantically relevant to the query, irrespective of spatial configuration. In this paper, we develop a spatial-semantic image search technology that enables users to search for images with both semantic and spatial constraints by manipulating concept text-boxes on a 2D query canvas. We train a convolutional neural network to synthesize appropriate visual features that captures the spatial-semantic constraints from the user canvas query. We directly optimize the retrieval performance of the visual features when training our deep neural network. These visual features then are used to retrieve images that are both spatially and semantically relevant to the user query. The experiments on large-scale datasets such as MS-COCO and Visual Genome show that our method outperforms other baseline and state-of-the-art methods in spatial-semantic image search.
引用
收藏
页码:1121 / 1130
页数:10
相关论文
共 64 条
[1]  
[Anonymous], 2016, VISUAL GENOME CONNEC
[2]  
[Anonymous], 2016, EUR C COMP VIS
[3]  
[Anonymous], INT WORKSH SIM BAS P
[4]  
[Anonymous], P IEEE INT C COMP VI
[5]  
[Anonymous], 2009, Search Engines: Information Retrieval in Practice
[6]  
[Anonymous], 2014, PROC COMPUT VIS PATT
[7]  
[Anonymous], 2014, EUR C COMP VIS
[8]  
[Anonymous], P BRIT MACH VIS C
[9]  
[Anonymous], 2015, KDD15 P 21 ACM, DOI DOI 10.1145/2783258.2788621
[10]  
[Anonymous], P 32 INT C MACH LEAR