SBIR-BYOL: a self-supervised sketch-based image retrieval model

被引：2

作者：

Saavedra, Jose M. ^{[1
]}

Morales, Javier ^{[2
]}

Murrugarra-Llerena, Nils ^{[3
]}

机构：

[1] Univ Los Andes, Fac Ingn & Ciencias Aplicadas, Santiago 7620001, RM, Chile

[2] Univ Chile, Dept Comp Sci, Av Blanco Encalada 2120, Santiago 8370459, RM, Chile

[3] Weber State Univ, Sch Comp, 3848 Harrison Blvd, Ogden, UT 84408 USA

来源：

NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 07期

关键词：

Sketch-based image retrieval; Self-supervision; Deep-learning; Representation learning; REPRESENTATIONS;

D O I：

10.1007/s00521-022-07978-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sketch-based image retrieval is demanding interest in the computer vision community due to its relevance in the visual perception system and its potential application in a wide diversity of industries. In the literature, we observe significant advances when the models are evaluated in public datasets. However, when assessed in real environments, the performance drops drastically. The big problem is that the SOTA SBIR models follow a supervised regimen, strongly depending on a considerable amount of labeled sketch-photo pairs, which is unfeasible in real contexts. Therefore, we propose SBIR-BYOL, an extension of the well-known BYOL, to work in a bimodal scenario for sketch-based image retrieval. To this end, we also propose a two-stage self-supervised training methodology, exploiting existing sketch-photo pairs and contour-photo pairs generated from photographs of a target catalog. We demonstrate the benefits of our model for the eCommerce environments, where searching is a critical component. Here, our self-supervised SBIR model shows an increase of over 60% of mAP.

引用

页码：5395 / 5408

页数：14

共 39 条

[1] Sketching out the details: Sketch-based image retrieval using convolutional neural networks with multi-stage regression
Bui, Tu
Ribeiro, Leonardo
Ponti, Moacir
Collomosse, John
[J]. COMPUTERS & GRAPHICS-UK, 2018, 71 : 77 - 87
[2] A COMPUTATIONAL APPROACH TO EDGE-DETECTION
CANNY, J
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (06) : 679 - 698
[3] SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis
Chen, Wengling
Hays, James
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9416 - 9425
[4] Storyboard sketches for content based video retrieval
Collomosse, J. P.
McNeill, G.
Qian, Y.
[J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 245 - 252
[5] Drawing as a Space for Social-Cognitive Interaction
De Andrade, Vanessa
Freire, Sofia
Baptista, Monica
Shwartz, Yael
[J]. EDUCATION SCIENCES, 2022, 12 (01):
[6] How Do Humans Sketch Objects?
Eitz, Mathias
Hays, James
Alexa, Marc
[J]. ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04):
[7] The Surprisingly Powerful Influence of Drawing on Memory
Fernandes, Myra A.
Wammes, Jeffrey D.
Meade, Melissa E.
[J]. CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2018, 27 (05) : 302 - 308
[8] CogSketch: Sketch Understanding for Cognitive Science Research and for Education
Forbus, Kenneth
Usher, Jeffrey
Lovett, Andrew
Lockwood, Kate
Wetzel, Jon
[J]. TOPICS IN COGNITIVE SCIENCE, 2011, 3 (04) : 648 - 666
[9] Sketch-QNet: A Quadruplet ConvNet for Color Sketch-based Image Retrieval
Fuentes, Anibal
Saavedra, Jose M.
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2134 - 2141
[10] Grill J.-B., 2020, P ADV NEUR INF PROC, V33, P21271, DOI DOI 10.48550/ARXIV.2006.07733

← 1 2 3 4 →