Query Adaptive Multiview Object Instance Search and Localization Using Sketches

被引:11
作者
Das Bhattacharjee, Sreyasee [1 ]
Yuan, Junsong [2 ]
Huang, Yicheng [3 ]
Meng, Jingjing [2 ]
Duan, Lingyu [3 ]
机构
[1] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA
[2] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
[3] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Sketch Based Search; object localization; object recognition; object retrieval; multi-view proposal selection; transductive clustering; IMAGE RETRIEVAL; RECOGNITION; HISTOGRAM;
D O I
10.1109/TMM.2018.2814338
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sketch-based object search is a challenging problem mainly due to three difficulties: 1) how to match the primary sketch query with the colorful image; 2) how to locate the small object in a big image that is similar to the sketch query; and 3) given the large image database, how to ensure an efficient search scheme that is reasonably scalable. To address the above challenges, we propose leveraging object proposals for object search and localization. However, instead of purely relying on sketch features, we propose fully utilizing the appearance features of object proposals to resolve the ambiguities between the matching sketch query and object proposals. Our proposed query adaptive search is formulated as a subgraph selection problem, which can be solved by the maximum flow algorithm. By performing query expansion, it can accurately locate the small target objects in a cluttered background or densely drawn deformation-intensive cartoon (Manga like) images. To improve the computing efficiency of matching proposal candidates, the proposed Multi View Spatially Constrained Proposal Selection encodes each identified object proposal in terms of a small local basis of anchor objects. The results on benchmark datasets validate the advantages of utilizing both the sketch and appearance features for sketch-based search, while ensuring sufficient scalability at the same time.
引用
收藏
页码:2761 / 2773
页数:13
相关论文
共 61 条
[41]  
Qian Y., 2015, P BRIT MACH VIS C
[42]  
Qu Y., 2008, P SIGGRAPH ACM T GRA
[43]   Color descriptor for content-based drawing retrieval [J].
Rigaud, Christophe ;
Burie, Jean-Christophe ;
Ogier, Jean-Marc ;
Karatzas, Dimosthenis .
2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, :267-271
[44]   An active contour model for speech balloon detection in comics [J].
Rigaud, Christophe ;
Burie, Jean-Christophe ;
Ogier, Jean-Marc ;
Karatzas, Dimosthenis ;
Van de Weijer, Joost .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :1240-1244
[45]  
Saavedra J.M., 2015, P BRIT MACH VIS C 20
[46]   RST-SHELO: sketch-based image retrieval using sketch tokens and square root normalization [J].
Saavedra, Jose M. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (01) :931-951
[47]  
Saavedra JM, 2014, IEEE IMAGE PROC, P2998, DOI 10.1109/ICIP.2014.7025606
[48]   Sketch-based image retrieval using keyshapes [J].
Saavedra, Jose M. ;
Bustos, Benjamin .
MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 73 (03) :2033-2062
[49]  
Saavedra JM, 2010, LECT NOTES COMPUT SC, V6376, P432
[50]   The Sketchy Database: Learning to Retrieve Badly Drawn Bunnies [J].
Sangkloy, Patsorn ;
Burnell, Nathan ;
Ham, Cusuh ;
Hays, James .
ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04)