Attention-driven Unsupervised Image Retrieval for Beauty Products with Visual and Textual Clues

被引:3
作者
Hou, Jingwen [1 ]
Ji, Sijie [1 ]
Wang, Annan [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
来源
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA | 2020年
关键词
image retrieval; attention mechanism; unsupervised learning;
D O I
10.1145/3394171.3416271
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Beauty and personal care product retrieval (BPCR) aims to match a query image of an item to examples of the same item in a large database. The task is extremely challenging because a small number of ground-truth examples have to be found in a large search space. Previous works mostly search only with visual representations and have not made full use of the product descriptions. Since many noisy examples only have subtle visual differences comparing to the ground-truth examples (e.g. similar packaging but different brands) and those differences (e.g. product brands) are especially hard to be captured only by visual features, methods merely based on visual feature similarities can easily regard those noisy examples as examples of the same item in the query image. We notice that the product descriptions are good sources for capturing those subtle visual differences. Therefore, we propose a search method utilizing both images and product descriptions in this work. Before searching, we not only prepare attention-based visual features for each database image but also a textual index (TI) that matches each database example to other examples with similar product descriptions. During searching, the visual feature of the query image is firstly searched in the whole database and then searched in a subset obtained by looking up the TI. Finally, the second result is used to refine the initial result. Since the subset examples usually have similar properties (e.g. brands and type), the noisy examples in the initial result can be effectively replaced. We have experimentally proved the effectiveness of the proposed method on the validation set of the Perfect-500K dataset. Our team (NTU-Beauty) achieved the 3rd place in the leader board of the Grand Challenge of AI Meets Beauty in ACM Multimedia 2020. Our code is available at: https://github.com/jingwenh/2020-ai-meetsbeauty_ntubeauty.git.
引用
收藏
页码:4718 / 4722
页数:5
相关论文
共 23 条
  • [11] Cross-domain Beauty Item Retrieval via Unsupervised Embedding Learning
    Lin, Zehang
    Xie, Haoran
    Kang, Peipei
    Yang, Zhenguo
    Liu, Wenyin
    Li, Qing
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2543 - 2547
  • [12] Regional Maximum Activations of Convolutions with Attention for Cross-domain Beauty and Personal Care Product Retrieval
    Lin, Zehang
    Yang, Zhenguo
    Huang, Feitao
    Chen, Junhong
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2073 - 2077
  • [13] Scale & affine invariant interest point detectors
    Mikolajczyk, K
    Schmid, C
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (01) : 63 - 86
  • [14] SharifRazavian A., 2014, Proceedings of the IEEE conference on computer vision and pattern recognition workshops, P806, DOI [DOI 10.1109/CVPRW.2014.131, 10.1109/cvprw.2014.131]
  • [15] Simultaneous Feature Aggregating and Hashing for Compact Binary Code Learning
    Thanh-Toan Do
    Khoa Le
    Tuan Hoang
    Huu Le
    Nguyen, Tam, V
    Ngai-Man Cheung
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 4954 - 4969
  • [16] Tolias G., 2015, INT C LEARN REPR
  • [17] The Retrieval of the Beautiful: Self-Supervised Salient Object Detection for Beauty Product Retrieval
    Wang, Jiawei
    Zhu, Shuai
    Xu, Jiao
    Cao, Da
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2548 - 2552
  • [18] Beauty Product Image Retrieval Based on Multi-Feature Fusion and Feature Aggregation
    Wang, Qi
    Lai, Jingxiang
    Xu, Kai
    Liu, Wenyin
    Lei, Liang
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2063 - 2067
  • [19] Polar Embedding for Aurora Image Retrieval
    Yang, Xi
    Gao, Xinbo
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 3332 - 3344
  • [20] Beauty Product Retrieval Based on Regional Maximum Activation of Convolutions with Generalized Attention
    Yu, Jun
    Xie, Guochen
    Li, Mengyan
    Xie, Haonian
    Yu, Lingyun
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2553 - 2557