Attention-driven Unsupervised Image Retrieval for Beauty Products with Visual and Textual Clues

被引：3

作者：

Hou, Jingwen ^{[1
]}

Ji, Sijie ^{[1
]}

Wang, Annan ^{[1
]}

机构：

[1] Nanyang Technol Univ, Singapore, Singapore

来源：

MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA | 2020年

关键词：

image retrieval; attention mechanism; unsupervised learning;

D O I：

10.1145/3394171.3416271

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Beauty and personal care product retrieval (BPCR) aims to match a query image of an item to examples of the same item in a large database. The task is extremely challenging because a small number of ground-truth examples have to be found in a large search space. Previous works mostly search only with visual representations and have not made full use of the product descriptions. Since many noisy examples only have subtle visual differences comparing to the ground-truth examples (e.g. similar packaging but different brands) and those differences (e.g. product brands) are especially hard to be captured only by visual features, methods merely based on visual feature similarities can easily regard those noisy examples as examples of the same item in the query image. We notice that the product descriptions are good sources for capturing those subtle visual differences. Therefore, we propose a search method utilizing both images and product descriptions in this work. Before searching, we not only prepare attention-based visual features for each database image but also a textual index (TI) that matches each database example to other examples with similar product descriptions. During searching, the visual feature of the query image is firstly searched in the whole database and then searched in a subset obtained by looking up the TI. Finally, the second result is used to refine the initial result. Since the subset examples usually have similar properties (e.g. brands and type), the noisy examples in the initial result can be effectively replaced. We have experimentally proved the effectiveness of the proposed method on the validation set of the Perfect-500K dataset. Our team (NTU-Beauty) achieved the 3rd place in the leader board of the Grand Challenge of AI Meets Beauty in ACM Multimedia 2020. Our code is available at: https://github.com/jingwenh/2020-ai-meetsbeauty_ntubeauty.git.

引用

页码：4718 / 4722

页数：5

共 23 条

[1] Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
[2] Bosch A, 2007, IEEE I CONF COMP VIS, P1863
[3] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[4] Fu Jianlong, 2020, PERFECT CORP CHALLEN
[5] Hou Jingwen, 2020, 2020 IEEE INT C IM P
[6] Hou Jingwen, 2020, P 28 ACM INT C MULT
[7] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[8] Densely Connected Convolutional Networks
Huang, Gao
Liu, Zhuang
van der Maaten, Laurens
Weinberger, Kilian Q.
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
[9] MS-RMAC: Multiscale Regional Maximum Activation of Convolutions for Image Retrieval
Li, Yang
Xu, Yulong
Wang, Jiabao
Miao, Zhuang
Zhang, Yafei
[J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (05) : 609 - 613
[10] Unprecedented Usage of Pre-trained CNNs on Beauty Product
Lim, Jian Han
Japar, Nurul
Ng, Chun Chet
Chan, Chee Seng
[J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2068 - 2072

← 1 2 3 →