Attention-driven Unsupervised Image Retrieval for Beauty Products with Visual and Textual Clues

被引:3
作者
Hou, Jingwen [1 ]
Ji, Sijie [1 ]
Wang, Annan [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
来源
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA | 2020年
关键词
image retrieval; attention mechanism; unsupervised learning;
D O I
10.1145/3394171.3416271
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Beauty and personal care product retrieval (BPCR) aims to match a query image of an item to examples of the same item in a large database. The task is extremely challenging because a small number of ground-truth examples have to be found in a large search space. Previous works mostly search only with visual representations and have not made full use of the product descriptions. Since many noisy examples only have subtle visual differences comparing to the ground-truth examples (e.g. similar packaging but different brands) and those differences (e.g. product brands) are especially hard to be captured only by visual features, methods merely based on visual feature similarities can easily regard those noisy examples as examples of the same item in the query image. We notice that the product descriptions are good sources for capturing those subtle visual differences. Therefore, we propose a search method utilizing both images and product descriptions in this work. Before searching, we not only prepare attention-based visual features for each database image but also a textual index (TI) that matches each database example to other examples with similar product descriptions. During searching, the visual feature of the query image is firstly searched in the whole database and then searched in a subset obtained by looking up the TI. Finally, the second result is used to refine the initial result. Since the subset examples usually have similar properties (e.g. brands and type), the noisy examples in the initial result can be effectively replaced. We have experimentally proved the effectiveness of the proposed method on the validation set of the Perfect-500K dataset. Our team (NTU-Beauty) achieved the 3rd place in the leader board of the Grand Challenge of AI Meets Beauty in ACM Multimedia 2020. Our code is available at: https://github.com/jingwenh/2020-ai-meetsbeauty_ntubeauty.git.
引用
收藏
页码:4718 / 4722
页数:5
相关论文
共 23 条
  • [1] Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
  • [2] Bosch A, 2007, IEEE I CONF COMP VIS, P1863
  • [3] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [4] Fu Jianlong, 2020, PERFECT CORP CHALLEN
  • [5] Hou Jingwen, 2020, 2020 IEEE INT C IM P
  • [6] Hou Jingwen, 2020, P 28 ACM INT C MULT
  • [7] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
  • [8] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [9] MS-RMAC: Multiscale Regional Maximum Activation of Convolutions for Image Retrieval
    Li, Yang
    Xu, Yulong
    Wang, Jiabao
    Miao, Zhuang
    Zhang, Yafei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (05) : 609 - 613
  • [10] Unprecedented Usage of Pre-trained CNNs on Beauty Product
    Lim, Jian Han
    Japar, Nurul
    Ng, Chun Chet
    Chan, Chee Seng
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2068 - 2072