Simultaneous classification and feature selection via convex quadratic programming with application to HIV-associated neurocognitive disorder assessment

被引:26
作者
Dunbar, Michelle [1 ]
Murray, John M. [2 ,3 ]
Cysique, Lucette A. [4 ]
Brew, Bruce J. [5 ]
Jeyakumar, Vaithilingam [1 ]
机构
[1] Univ New S Wales, Dept Appl Math, Sydney, NSW, Australia
[2] Univ New S Wales, Sch Math & Stat, Sydney, NSW, Australia
[3] Univ New S Wales, Natl Ctr HIV Epidemiol & Clin Res, Sydney, NSW, Australia
[4] Brain Sci Univ New S Wales, Sydney, NSW, Australia
[5] St Vincents Hosp, Dept Neurol, Sydney, NSW 2010, Australia
关键词
Quadratic optimization; Support vector machines; Classification; Feature selection; Nonnegativity constraints; HIV; Neurocognitive disorder; SUPPORT VECTOR MACHINES; GENE SELECTION; NEWTON METHOD; CANCER;
D O I
10.1016/j.ejor.2010.03.017
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Support vector machines (SVMs), that utilize a mixture of the L-1-norm and the L-2-norm penalties, are capable of performing simultaneous classification and selection of highly correlated features. These SVMs, typically set up as convex programming problems, are re-formulated here as simple convex quadratic minimization problems over non-negativity constraints, giving rise to a new formulation the pq-SVM method. Solutions to our re-formulation are obtained efficiently by an extremely simple algorithm. Computational results on a range of publicly available datasets indicate that these methods allow greater classification accuracy in addition to selecting groups of highly correlated features. These methods were also compared on a new dataset assessing HIV-associated neurocognitive disorder in a group of 97 HIV-infected individuals. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:470 / 478
页数:9
相关论文
共 23 条
[1]  
[Anonymous], 2007, UCI Machine Learning Repository
[2]  
BRADLEY PS, 1998, P 15 INT C ICML98 MA
[3]  
Brew BJ., 2008, J NEUROIMMUNE PHARM
[4]   Finite Newton method for Lagrangian support vector machine classification [J].
Fung, G ;
Mangasarian, OL .
NEUROCOMPUTING, 2003, 55 (1-2) :39-55
[5]  
Fung G., 2000, P 6 ACM SIGKDD INT C
[6]   A feature selection Newton method for support vector machine classification [J].
Fung, GM ;
Mangasarian, OL .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2004, 28 (02) :185-202
[7]  
Golub G. H., 1996, MATRIX COMPUTATIONS
[8]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[9]  
GUNN SR, 2007, MATLAB SVM TOOLBOX
[10]   Gene selection for cancer classification using support vector machines [J].
Guyon, I ;
Weston, J ;
Barnhill, S ;
Vapnik, V .
MACHINE LEARNING, 2002, 46 (1-3) :389-422