Attention-driven Unsupervised Image Retrieval for Beauty Products with Visual and Textual Clues

被引：3

作者：

Hou, Jingwen ^{[1
]}

Ji, Sijie ^{[1
]}

Wang, Annan ^{[1
]}

机构：

[1] Nanyang Technol Univ, Singapore, Singapore

来源：

MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA | 2020年

关键词：

image retrieval; attention mechanism; unsupervised learning;

D O I：

10.1145/3394171.3416271

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Beauty and personal care product retrieval (BPCR) aims to match a query image of an item to examples of the same item in a large database. The task is extremely challenging because a small number of ground-truth examples have to be found in a large search space. Previous works mostly search only with visual representations and have not made full use of the product descriptions. Since many noisy examples only have subtle visual differences comparing to the ground-truth examples (e.g. similar packaging but different brands) and those differences (e.g. product brands) are especially hard to be captured only by visual features, methods merely based on visual feature similarities can easily regard those noisy examples as examples of the same item in the query image. We notice that the product descriptions are good sources for capturing those subtle visual differences. Therefore, we propose a search method utilizing both images and product descriptions in this work. Before searching, we not only prepare attention-based visual features for each database image but also a textual index (TI) that matches each database example to other examples with similar product descriptions. During searching, the visual feature of the query image is firstly searched in the whole database and then searched in a subset obtained by looking up the TI. Finally, the second result is used to refine the initial result. Since the subset examples usually have similar properties (e.g. brands and type), the noisy examples in the initial result can be effectively replaced. We have experimentally proved the effectiveness of the proposed method on the validation set of the Perfect-500K dataset. Our team (NTU-Beauty) achieved the 3rd place in the leader board of the Grand Challenge of AI Meets Beauty in ACM Multimedia 2020. Our code is available at: https://github.com/jingwenh/2020-ai-meetsbeauty_ntubeauty.git.

引用

页码：4718 / 4722

页数：5

共 23 条

[11] Cross-domain Beauty Item Retrieval via Unsupervised Embedding Learning
Lin, Zehang
Xie, Haoran
Kang, Peipei
Yang, Zhenguo
Liu, Wenyin
Li, Qing
[J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2543 - 2547
[12] Regional Maximum Activations of Convolutions with Attention for Cross-domain Beauty and Personal Care Product Retrieval
Lin, Zehang
Yang, Zhenguo
Huang, Feitao
Chen, Junhong
[J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2073 - 2077
[13] Scale & affine invariant interest point detectors
Mikolajczyk, K
Schmid, C
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (01) : 63 - 86
[14] SharifRazavian A., 2014, Proceedings of the IEEE conference on computer vision and pattern recognition workshops, P806, DOI [DOI 10.1109/CVPRW.2014.131, 10.1109/cvprw.2014.131]
[15] Simultaneous Feature Aggregating and Hashing for Compact Binary Code Learning
Thanh-Toan Do
Khoa Le
Tuan Hoang
Huu Le
Nguyen, Tam, V
Ngai-Man Cheung
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 4954 - 4969
[16] Tolias G., 2015, INT C LEARN REPR
[17] The Retrieval of the Beautiful: Self-Supervised Salient Object Detection for Beauty Product Retrieval
Wang, Jiawei
Zhu, Shuai
Xu, Jiao
Cao, Da
[J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2548 - 2552
[18] Beauty Product Image Retrieval Based on Multi-Feature Fusion and Feature Aggregation
Wang, Qi
Lai, Jingxiang
Xu, Kai
Liu, Wenyin
Lei, Liang
[J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2063 - 2067
[19] Polar Embedding for Aurora Image Retrieval
Yang, Xi
Gao, Xinbo
Tian, Qi
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 3332 - 3344
[20] Beauty Product Retrieval Based on Regional Maximum Activation of Convolutions with Generalized Attention
Yu, Jun
Xie, Guochen
Li, Mengyan
Xie, Haonian
Yu, Lingyun
[J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2553 - 2557

← 1 2 3 →