PRODUCT IMAGE REPRESENTATION LEARNING ON LARGE SCALE NOISY DATASETS

被引:0
|
作者
Joshi, Aniket
Das, Nilotpal
Yenigalla, Promod
机构
关键词
D O I
10.1109/ICIP49359.2023.10222339
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning product similarity using distance metric learning from real world catalog needs to take care of large number of product categories and noisy labels. On one hand, large number of product categories makes online hard mining (OHM) less effective as hard triplets become sparse and thus difficult to find. On the other hand, the validity of the hard-triplets themselves is less certain in the case of noisy labelled training data. In this paper, we address the problem of large-scale product representation learning in the presence of noisy training data. To address these challenges, we propose a novel co-teaching based label correction scheme for distance metric learning, that is motivated by the inconsistencies of variations relationships in the product catalog. To validate our approach, we conducted experiments on 20 different product categories, where we achieve up to 4% improvement in PR-AUC compared to the SOTA baseline and conclude by discussing the durable learnings we gained from these experiments and directions for future research.
引用
收藏
页码:2570 / 2574
页数:5
相关论文
共 50 条
  • [1] Harnessing Large-Scale Herbarium Image Datasets Through Representation Learning
    Walker, Barnaby E.
    Tucker, Allan
    Nicolson, Nicky
    FRONTIERS IN PLANT SCIENCE, 2022, 12
  • [2] Learning From Noisy Large-Scale Datasets With Minimal Supervision
    Veit, Andreas
    Alldrin, Neil
    Chechik, Gal
    Krasin, Ivan
    Gupta, Abhinav
    Belongie, Serge
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6575 - 6583
  • [3] Robust Subspace Clustering with Block Diagonal Representation for Noisy Image Datasets
    Li, Qiang
    Xie, Ziqi
    Wang, Lihong
    ELECTRONICS, 2023, 12 (05)
  • [4] Large Scale Image Annotation via Deep Representation Learning and Tag Embedding Learning
    He, Yonghao
    Wang, Jian
    Kang, Cuicui
    Xiang, Shiming
    Pan, Chunhong
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 523 - 526
  • [5] Product image recognition with guidance learning and noisy supervision
    Li, Qing
    Peng, Xiaojiang
    Cao, Liangliang
    Du, Wenbin
    Xing, Hao
    Qiao, Yu
    Peng, Qiang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 196
  • [6] DendroMap: Visual Exploration of Large-Scale Image Datasets for Machine Learning with Treemaps
    Bertucci D.
    Hamid M.M.
    Anand Y.
    Ruangrotsakun A.
    Tabatabai D.
    Perez M.
    Kahng M.
    IEEE Transactions on Visualization and Computer Graphics, 2023, 29 (01) : 320 - 330
  • [7] On-the-fly learning for visual search of large-scale image and video datasets
    Chatfield, Ken
    Arandjelovic, Relja
    Parkhi, Omkar
    Zisserman, Andrew
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2015, 4 (02) : 75 - 93
  • [8] Learning to Index in Large-Scale Datasets
    Prayoonwong, Amorntip
    Wang, Cheng-Hsien
    Chiu, Chih-Yi
    MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 305 - 316
  • [9] Multiscale Markov random fields for large image datasets representation
    Rehrauer, H
    Seidel, K
    Datcu, M
    IGARSS '97 - 1997 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, PROCEEDINGS VOLS I-IV: REMOTE SENSING - A SCIENTIFIC VISION FOR SUSTAINABLE DEVELOPMENT, 1997, : 255 - 257
  • [10] Large-Scale Taxonomy Categorization for Noisy Product Listings
    Das, Pradipto
    Xia, Yandi
    Levine, Aaron
    Di Fabbrizio, Giuseppe
    Datta, Ankur
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3885 - 3894