Multiclass Laplacian support vector machine with functional analysis of variance decomposition

被引:0
作者
Park, Beomjin [1 ,2 ]
Park, Changyi [3 ]
机构
[1] Gyeongsang Natl Univ, Dept Informat & Stat, Jinju 52828, South Korea
[2] Gyeongsang Natl Univ, Dept Bio & Med Big Data, Plus Program BK21, Jinju 52828, South Korea
[3] Univ Seoul, Dept Stat, Seoul 02504, South Korea
基金
新加坡国家研究基金会;
关键词
Laplacian support vector machine; Multiclass classification; Semi-supervised learning; Variable selection; GENE SELECTION; CLASSIFICATION;
D O I
10.1016/j.csda.2023.107814
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In classification problems, acquiring a sufficient amount of labeled samples sometimes proves expensive and time-consuming, while unlabeled samples are relatively easier to obtain. The Laplacian Support Vector Machine (LapSVM) is one of the successful methods that learn better classification functions by incorporating unlabeled samples. However, since LapSVM was originally designed for binary classification, it can not be applied directly to multiclass classification problems commonly encountered in practice. Thus we derive an extension of LapSVM to multiclass classification problems using an appropriate multiclass formulation. Another problem with LapSVM is that irrelevant variables easily degrade classification performance. The irrelevant variables can increase the variance of predicted values and make the model difficult to interpret. Therefore, this paper also proposes the multiclass LapSVM with functional analysis of variance decomposition to identify relevant variables. Through comprehensive simulations and real-world datasets, we demonstrate the efficiency and improved classification performance of the proposed methods.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:21
相关论文
共 44 条
[1]  
[Anonymous], 2007, Artificial intelligence and statistics
[2]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[3]  
Bennett KP, 1999, ADV NEUR IN, V11, P368
[4]  
Biau G, 2016, TEST-SPAIN, V25, P197, DOI 10.1007/s11749-016-0481-7
[5]  
Bradley P. S., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P82
[6]   Multicategory classification by support vector machines [J].
Bredensteiner, EJ ;
Bennett, KP .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 1999, 12 (1-3) :53-79
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[9]  
Chapelle O, 2008, J MACH LEARN RES, V9, P203
[10]  
Collobert R, 2006, J MACH LEARN RES, V7, P1687