Large scale multi-class classification with truncated nuclear norm regularization

被引：9

作者：

Hu, Yao ^{[1
]}

Jin, Zhongming ^{[1
]}

Shi, Yi ^{[1
]}

Zhang, Debing ^{[1
]}

Cai, Deng ^{[1
]}

He, Xiaofei ^{[1
]}

机构：

[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310058, Zhejiang, Peoples R China

来源：

NEUROCOMPUTING | 2015年 / 148卷

基金：

中国国家自然科学基金;

关键词：

Truncated nuclear norm; Coordinate descent algorithm; Multi-class classification;

D O I：

10.1016/j.neucom.2014.06.073

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we consider the problem of multi-class image classification when the classes behaviour has a low rank structure. That is, classes can be embedded into a low dimensional space. Traditional multi-class classification algorithms usually use nuclear norm to approximate the rank of the weight matrix. Considering the limited ability of the nuclear norm for the accurate approximation, we propose a new scalable large scale multi-class classification algorithm by using the recently proposed truncated nuclear norm as a better surrogate of the rank operator of matrices along with multinomial logisitic loss. To solve the non-convex and non-smooth optimization problem, we further develop an efficient iterative procedure. In each iteration, by lifting the non-smooth convex subproblem into an infinite dimensional l(1) norm regularized problem, a simple and efficient accelerated coordinate descent algorithm is applied to find the optimal solution; We conduct a series of evaluations on several public large scale image datasets, where the experimental results show the encouraging improvement of classification accuracy of the proposed algorithm in comparison with the state-of-the-art multi-class classification algorithms. (C) 2014 Elsevier B.V. All rights reserved.

引用

页码：310 / 317

页数：8

共 32 条

[1] Amit Y., 2007, P INT C MACH LEARN
[2] [Anonymous], 2011, COMPUTER VISION PATT
[3] [Anonymous], 1998, Using SeDuMi 1.02
[4] [Anonymous], P IEEE INT C COMP VI
[5] [Anonymous], 2014, Propack-software for large and sparse svd calculations
[6] Convexity, classification, and risk bounds
Bartlett, PL
Jordan, MI
McAuliffe, JD
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) : 138 - 156
[7] A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems
Beck, Amir
Teboulle, Marc
[J]. SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01): : 183 - 202
[8] Boyd S., 2011, FOUND TRENDS MACH LE, V3, P1, DOI DOI 10.1561/2200000016
[9] Chen K, 2005, Matrix Preconditioning Techniques and Applications, V19
[10] On the algorithmic implementation of multiclass kernel-based vector machines
Crammer, K
Singer, Y
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) : 265 - 292

← 1 2 3 4 →