Powering Robust Fashion Retrieval with Information Rich Feature Embeddings

被引:8
作者
Chopra, Ayush [1 ]
Sinha, Abhishek [1 ]
Gupta, Hiresh [1 ]
Sarkar, Mausoom [1 ]
KumarAyush [1 ]
Balaji, K. [1 ]
机构
[1] Adobe Inc, Media & Data Sci Res, San Jose, CA 95110 USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019) | 2019年
关键词
D O I
10.1109/CVPRW.2019.00045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual content-based product retrieval has become increasingly important for e-commerce. Fashion retrieval, in particular, is a challenging problem owing to a wide range of visual distortions in their product images. In this paper, we propose a Grid Search Network (GSN) for learning feature embeddings for fashion retrieval. The proposed approach posits the training procedure as a search problem, focused on locating matches for a reference query image in a grid containing both positive and negative images w.rt the query. The proposed framework significantly outperforms existing state-of-the-art methods on benchmark fashion datasets. We also utilize a reinforcement learning based strategy to learn a specialized transformation function which further improves retrieval performance when applied over the feature embeddings. We also extend the reinforcement learning based strategy to learn custom kernel functions for SVM based classification over FashionMNIST and MNIST datasets, showing improved performance. We highlight the generalization capabilities of this search strategy by showing performance improvement in domains beyond fashion.
引用
收藏
页码:326 / 334
页数:9
相关论文
共 22 条
[1]  
[Anonymous], CORR
[2]  
[Anonymous], 2017, ARXIV170907417
[3]  
Brain Google., 2018, 6 INT C LEARNING REP
[4]  
Ciresan D, 2012, PROC CVPR IEEE, P3642, DOI 10.1109/CVPR.2012.6248110
[5]  
Collobert R., 2008, Proceedings of the 25th international conference on Machine learning, V25, P160, DOI DOI 10.1145/1390156.1390177
[6]   Fundamental Technologies in Modern Speech Recognition [J].
Furui, Sadaoki ;
Deng, Li ;
Gales, Mark ;
Ney, Hermann ;
Tokuda, Keiichi .
IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) :16-17
[7]   Cross-domain Image Retrieval with a Dual Attribute-aware Ranking Network [J].
Huang, Junshi ;
Feris, Rogerio ;
Chen, Qiang ;
Yan, Shuicheng .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1062-1070
[8]   Where to Buy It: Matching Street Clothing Photos in Online Shops [J].
Kiapour, M. Hadi ;
Han, Xufeng ;
Lazebnik, Svetlana ;
Berg, Alexander C. ;
Berg, Tamara L. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3343-3351
[9]   3D Object Representations for Fine-Grained Categorization [J].
Krause, Jonathan ;
Stark, Michael ;
Deng, Jia ;
Li Fei-Fei .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :554-561
[10]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90