Large scale multi-class classification with truncated nuclear norm regularization

被引:9
作者
Hu, Yao [1 ]
Jin, Zhongming [1 ]
Shi, Yi [1 ]
Zhang, Debing [1 ]
Cai, Deng [1 ]
He, Xiaofei [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310058, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Truncated nuclear norm; Coordinate descent algorithm; Multi-class classification;
D O I
10.1016/j.neucom.2014.06.073
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider the problem of multi-class image classification when the classes behaviour has a low rank structure. That is, classes can be embedded into a low dimensional space. Traditional multi-class classification algorithms usually use nuclear norm to approximate the rank of the weight matrix. Considering the limited ability of the nuclear norm for the accurate approximation, we propose a new scalable large scale multi-class classification algorithm by using the recently proposed truncated nuclear norm as a better surrogate of the rank operator of matrices along with multinomial logisitic loss. To solve the non-convex and non-smooth optimization problem, we further develop an efficient iterative procedure. In each iteration, by lifting the non-smooth convex subproblem into an infinite dimensional l(1) norm regularized problem, a simple and efficient accelerated coordinate descent algorithm is applied to find the optimal solution; We conduct a series of evaluations on several public large scale image datasets, where the experimental results show the encouraging improvement of classification accuracy of the proposed algorithm in comparison with the state-of-the-art multi-class classification algorithms. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:310 / 317
页数:8
相关论文
共 32 条
  • [1] Amit Y., 2007, P INT C MACH LEARN
  • [2] [Anonymous], 2011, COMPUTER VISION PATT
  • [3] [Anonymous], 1998, Using SeDuMi 1.02
  • [4] [Anonymous], P IEEE INT C COMP VI
  • [5] [Anonymous], 2014, Propack-software for large and sparse svd calculations
  • [6] Convexity, classification, and risk bounds
    Bartlett, PL
    Jordan, MI
    McAuliffe, JD
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) : 138 - 156
  • [7] A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems
    Beck, Amir
    Teboulle, Marc
    [J]. SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01): : 183 - 202
  • [8] Boyd S., 2011, FOUND TRENDS MACH LE, V3, P1, DOI DOI 10.1561/2200000016
  • [9] Chen K, 2005, Matrix Preconditioning Techniques and Applications, V19
  • [10] On the algorithmic implementation of multiclass kernel-based vector machines
    Crammer, K
    Singer, Y
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) : 265 - 292