Fine-grained Categorization and Dataset Bootstrapping using Deep Metric Learning with Humans in the Loop

被引:149
|
作者
Cui, Yin [1 ,2 ]
Zhou, Feng [3 ]
Lin, Yuanqing [3 ]
Belongie, Serge [1 ,2 ]
机构
[1] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
[2] Cornell Tech, New York, NY 10011 USA
[3] NEC Labs Amer, Princeton, NJ USA
关键词
D O I
10.1109/CVPR.2016.130
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing fine-grained visual categorization methods often suffer from three challenges: lack of training data, large number of fine-grained categories, and high intraclass vs. low inter-class variance. In this work we propose a generic iterative framework for fine-grained categorization and dataset bootstrapping that handles these three challenges. Using deep metric learning with humans in the loop, we learn a low dimensional feature embedding with anchor points on manifolds for each category. These anchor points capture intra-class variances and remain discriminative between classes. In each round, images with high confidence scores from our model are sent to humans for labeling. By comparing with exemplar images, labelers mark each candidate image as either a "true positive" or a "false positive." True positives are added into our current dataset and false positives are regarded as "hard negatives" for our metric learning model. Then the model is retrained with an expanded dataset and hard negatives for the next round. To demonstrate the effectiveness of the proposed framework, we bootstrap a fine-grained flower dataset with 620 categories from Instagram images. The proposed deep metric learning scheme is evaluated on both our dataset and the CUB-200-2001 Birds dataset. Experimental evaluations show significant performance gain using dataset bootstrapping and demonstrate state-of-the-art results achieved by the proposed deep metric learning methods.
引用
收藏
页码:1153 / 1162
页数:10
相关论文
共 50 条
  • [1] Feathers Dataset for Fine-Grained Visual Categorization
    Belko, Alina
    Dobratulin, Konstantin
    Kuznetsov, Andrey
    THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
  • [2] Fine-grained Patient Similarity Measuring using Deep Metric Learning
    Ni, Jiazhi
    Liu, Jie
    Zhang, Chenxin
    Ye, Dan
    Ma, Zhirou
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1189 - 1198
  • [3] A survey of fine-grained visual categorization based on deep learning
    XIE Yuxiang
    GONG Quanzhi
    LUAN Xidao
    YAN Jie
    ZHANG Jiahui
    Journal of Systems Engineering and Electronics, 2024, 35 (06) : 1337 - 1356
  • [4] A Survey of Fine-Grained Visual Categorization Based on Deep Learning
    Xie, Yuxiang
    Gong, Quanzhi
    Luan, Xidao
    Yan, Jie
    Zhang, Jiahui
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) : 1337 - 1356
  • [5] A survey of fine-grained visual categorization based on deep learning
    Xie Yuxiang
    Gong Quanzhi
    Luan Xidao
    Yan Jie
    Zhang Jiahui
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023,
  • [6] Improve Fine-Grained Feature Learning in Fine-Grained DataSet GAI
    Wang, Hai Peng
    Geng, Zhi Qing
    IEEE ACCESS, 2025, 13 : 12777 - 12788
  • [7] Fine-Grained Visual Categorization via Multi-stage Metric Learning
    Qian, Qi
    Jin, Rong
    Zhu, Shenghuo
    Lin, Yuanqing
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3716 - 3724
  • [8] StackDRL: Stacked Deep Reinforcement Learning for Fine-grained Visual Categorization
    He, Xiangteng
    Peng, Yuxin
    Zhao, Junjie
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 741 - 747
  • [9] Hierarchical deep transfer learning for fine-grained categorization on micro datasets
    Wang, Ronggui
    Yao, Xuchen
    Yang, Juan
    Xue, Lixia
    Hu, Min
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 129 - 139
  • [10] Fine-Grained Categorization Using a Mixture of Transfer Learning Networks
    Firsching, Justin
    Hashem, Sherif
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2021, VOL 2, 2022, 359 : 151 - 158