Constrained subspace classifier for high dimensional datasets

被引:8
作者
Panagopoulos, Orestis P. [1 ]
Pappu, Vijay [2 ]
Xanthopoulos, Petros [1 ]
Pardalos, Panos M. [2 ]
机构
[1] Univ Cent Florida, Dept Ind Engn & Management Syst, Orlando, FL 32816 USA
[2] Univ Florida, Dept Ind Engn, Gainesville, FL 32608 USA
来源
OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE | 2016年 / 59卷
关键词
Constrained subspace classifier; High dimensional datasets; Principal angles; Local subspace classifier; FEATURE-SELECTION; SETS;
D O I
10.1016/j.omega.2015.05.009
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Datasets with significantly larger number of features, compared to samples, pose a serious challenge in supervised learning. Such datasets arise in various areas including business analytics. In this paper, a new binary classification method called constrained subspace classifier (CSC) is proposed for such high dimensional datasets. CSC improves on an earlier proposed classification method called local subspace classifier (LSC) by accounting for the relative angle between subspaces while approximating the classes with individual subspaces. CSC is formulated as an optimization problem and can be solved by an efficient alternating optimization technique. Classification performance is tested in publicly available datasets. The improvement in classification accuracy over LSC shows the importance of considering the relative angle between the subspaces while approximating the classes. Additionally, CSC appears to be a robust classifier, compared to traditional two step methods that perform feature selection and classification in two distinct steps. (c) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:40 / 46
页数:7
相关论文
共 47 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
[Anonymous], 2002, Principal components analysis
[3]  
[Anonymous], 2012, MATRIX COMPUTATIONS
[4]  
[Anonymous], 2000, 5 ONL WORLD C SOFT C, DOI [10.1097/ALN.0000000000002350, DOI 10.1200/JCO.2010.30]
[5]  
[Anonymous], 1997, The econometrics of financial markets, DOI DOI 10.1515/9781400830213
[6]  
[Anonymous], 1998, FEATURE EXTRACTION C
[7]   High-Dimensional Methods and Inference on Structural and Treatment Effects [J].
Belloni, Alexandre ;
Chernozhukov, Victor ;
Hansen, Christian .
JOURNAL OF ECONOMIC PERSPECTIVES, 2014, 28 (02) :29-50
[8]  
Bennett J., 2007, PROC KDD CUP WORK SH, P6
[9]  
Beyer K, 1999, LECT NOTES COMPUT SC, V1540, P217
[10]   NUMERICAL METHODS FOR COMPUTING ANGLES BETWEEN LINEAR SUBSPACES [J].
BJORCK, A ;
GOLUB, GH .
MATHEMATICS OF COMPUTATION, 1973, 27 (123) :579-594