SBIR-BYOL: a self-supervised sketch-based image retrieval model

被引:2
作者
Saavedra, Jose M. [1 ]
Morales, Javier [2 ]
Murrugarra-Llerena, Nils [3 ]
机构
[1] Univ Los Andes, Fac Ingn & Ciencias Aplicadas, Santiago 7620001, RM, Chile
[2] Univ Chile, Dept Comp Sci, Av Blanco Encalada 2120, Santiago 8370459, RM, Chile
[3] Weber State Univ, Sch Comp, 3848 Harrison Blvd, Ogden, UT 84408 USA
关键词
Sketch-based image retrieval; Self-supervision; Deep-learning; Representation learning; REPRESENTATIONS;
D O I
10.1007/s00521-022-07978-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sketch-based image retrieval is demanding interest in the computer vision community due to its relevance in the visual perception system and its potential application in a wide diversity of industries. In the literature, we observe significant advances when the models are evaluated in public datasets. However, when assessed in real environments, the performance drops drastically. The big problem is that the SOTA SBIR models follow a supervised regimen, strongly depending on a considerable amount of labeled sketch-photo pairs, which is unfeasible in real contexts. Therefore, we propose SBIR-BYOL, an extension of the well-known BYOL, to work in a bimodal scenario for sketch-based image retrieval. To this end, we also propose a two-stage self-supervised training methodology, exploiting existing sketch-photo pairs and contour-photo pairs generated from photographs of a target catalog. We demonstrate the benefits of our model for the eCommerce environments, where searching is a critical component. Here, our self-supervised SBIR model shows an increase of over 60% of mAP.
引用
收藏
页码:5395 / 5408
页数:14
相关论文
共 39 条
  • [1] Sketching out the details: Sketch-based image retrieval using convolutional neural networks with multi-stage regression
    Bui, Tu
    Ribeiro, Leonardo
    Ponti, Moacir
    Collomosse, John
    [J]. COMPUTERS & GRAPHICS-UK, 2018, 71 : 77 - 87
  • [3] SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis
    Chen, Wengling
    Hays, James
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9416 - 9425
  • [4] Storyboard sketches for content based video retrieval
    Collomosse, J. P.
    McNeill, G.
    Qian, Y.
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 245 - 252
  • [5] Drawing as a Space for Social-Cognitive Interaction
    De Andrade, Vanessa
    Freire, Sofia
    Baptista, Monica
    Shwartz, Yael
    [J]. EDUCATION SCIENCES, 2022, 12 (01):
  • [6] How Do Humans Sketch Objects?
    Eitz, Mathias
    Hays, James
    Alexa, Marc
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04):
  • [7] The Surprisingly Powerful Influence of Drawing on Memory
    Fernandes, Myra A.
    Wammes, Jeffrey D.
    Meade, Melissa E.
    [J]. CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2018, 27 (05) : 302 - 308
  • [8] CogSketch: Sketch Understanding for Cognitive Science Research and for Education
    Forbus, Kenneth
    Usher, Jeffrey
    Lovett, Andrew
    Lockwood, Kate
    Wetzel, Jon
    [J]. TOPICS IN COGNITIVE SCIENCE, 2011, 3 (04) : 648 - 666
  • [9] Sketch-QNet: A Quadruplet ConvNet for Color Sketch-based Image Retrieval
    Fuentes, Anibal
    Saavedra, Jose M.
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2134 - 2141
  • [10] Grill J.-B., 2020, P ADV NEUR INF PROC, V33, P21271, DOI DOI 10.48550/ARXIV.2006.07733