Adaptive Margin Diversity Regularizer for Handling Data Imbalance in Zero-Shot SBIR

被引:15
作者
Dutta, Titir [1 ]
Singh, Anurag [1 ]
Biswas, Soma [1 ]
机构
[1] Indian Inst Sci, Bangalore, India
来源
COMPUTER VISION - ECCV 2020, PT V | 2020年 / 12350卷
关键词
D O I
10.1007/978-3-030-58558-7_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data from new categories are continuously being discovered, which has sparked significant amount of research in developing approaches which generalize to previously unseen categories, i.e. zeroshot setting. Zero-shot sketch-based image retrieval (ZS-SBIR) is one such problem in the context of cross-domain retrieval, which has received lot of attention due to its various real-life applications. Since most real-world training data have a fair amount of imbalance; in this work, for the first time in literature, we extensively study the effect of training data imbalance on the generalization to unseen categories, with ZS-SBIR as the application area. We evaluate several state-of-the-art data imbalance mitigating techniques and analyze their results. Furthermore, we propose a novel framework AMDReg (Adaptive Margin Diversity Regularizer), which ensures that the embeddings of the sketches and images in the latent space are not only semantically meaningful, but they are also separated according to their class-representations in the training set. The proposed approach is model-independent, and it can be incorporated seamlessly with several state-of-the-art ZS-SBIR methods to improve their performance under imbalanced condition. Extensive experiments and analysis justify the effectiveness of the proposed AMDReg for mitigating the effect of data imbalance for generalization to unseen classes in ZS-SBIR.
引用
收藏
页码:349 / 364
页数:16
相关论文
共 37 条
[1]  
Barandela R, 2003, LECT NOTES COMPUT SC, V2905, P424
[2]  
Cao KD, 2019, ADV NEUR IN, V32
[3]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[4]   Class-Balanced Loss Based on Effective Number of Samples [J].
Cui, Yin ;
Jia, Menglin ;
Lin, Tsung-Yi ;
Song, Yang ;
Belongie, Serge .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9260-9269
[5]   Doodle to Search: Practical Zero-Shot Sketch-based Image Retrieval [J].
Dey, Sounak ;
Riba, Pau ;
Dutta, Anjan ;
Llados, Josep ;
Song, Yi-Zhe .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2174-2183
[6]   Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval [J].
Dutta, Anjan ;
Akata, Zeynep .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5084-5093
[7]  
Dutta T., 2019, BMVC
[8]   How Do Humans Sketch Objects? [J].
Eitz, Mathias ;
Hays, James ;
Alexa, Marc .
ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04)
[9]   Multi-modal Cycle-Consistent Generalized Zero-Shot Learning [J].
Felix, Rafael ;
Kumar, B. G. Vijay ;
Reid, Ian ;
Carneiro, Gustavo .
COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 :21-37
[10]  
Frome A., 2013, Advances in neural information processing systems, V26, P2121