Good Practice in Large-Scale Learning for Image Classification

被引:104
作者
Akata, Zeynep [1 ]
Perronnin, Florent [1 ,2 ]
Harchaoui, Zaid [2 ]
Schmid, Cordelia [2 ]
机构
[1] Xerox Res Ctr Europe, F-38240 Meylan, Isere, France
[2] INRIA Grenoble Rhone Alpes, F-38330 Montbonnot St Martin, Isere, France
关键词
Large scale; fine-grained visual categorization; image classification; ranking; SVM; stochastic learning; CLASSIFIERS;
D O I
10.1109/TPAMI.2013.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We benchmark several SVM objective functions for large-scale image classification. We consider one-versus-rest, multiclass, ranking, and weighted approximate ranking SVMs. A comparison of online and batch methods for optimizing the objectives shows that online methods perform as well as batch methods in terms of classification accuracy, but with a significant gain in training speed. Using stochastic gradient descent, we can scale the training to millions of images and thousands of classes. Our experimental evaluation shows that ranking-based algorithms do not outperform the one-versus-rest strategy when a large number of training examples are used. Furthermore, the gap in accuracy between the different algorithms shrinks as the dimension of the features increases. We also show that learning through cross-validation the optimal rebalancing of positive and negative examples can result in a significant improvement for the one-versus-rest strategy. Finally, early stopping can be used as an effective regularization strategy when training with online algorithms. Following these "good practices," we were able to improve the state of the art on a large subset of 10K classes and 9M images of ImageNet from 16.7 percent Top-1 accuracy to 19.1 percent.
引用
收藏
页码:507 / 520
页数:14
相关论文
共 89 条
  • [1] Allwein E.L., 2000, P INT C MACH LEARN I
  • [2] [Anonymous], P C NEUR INF PROC SY
  • [3] [Anonymous], 2010, P IEEE C COMP VIS PA
  • [4] [Anonymous], P IEEE INT C COMP VI
  • [5] [Anonymous], P IEEE C COMP VIS PA
  • [6] [Anonymous], P IEEE INT C COMP VI
  • [7] [Anonymous], 2007, P IEEE C COMP VIS PA
  • [8] [Anonymous], P IEEE C COMP VIS PA
  • [9] [Anonymous], 2004, P ECCV WORKSH STAT L
  • [10] [Anonymous], 2002, P ACM SIGKDD KDD 200, DOI 10.1145/775047.775067