Network-Based Discriminant Analysis for Multiclassification

被引:7
作者
Chen, Li-Pang [1 ]
机构
[1] Natl Chengchi Univ, Dept Stat, Taipei 116, Taiwan
关键词
F-score; Gaussian graphical models; Discriminant function; Multiclassification; Network structure; Precision matrix; Prediction; VARIABLE SELECTION; GRAPHICAL MODELS; CLASSIFICATION;
D O I
10.1007/s00357-022-09414-y
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Classification for multi-label responses, known as multiclassification, has been an important problem in supervised learning and has attracted our attention. In the framework of statistical learning, discriminant analysis is a powerful method to do multiclassification. With the increasing availability of complex data, it becomes more challenging to analyze them. One of the important features in complex data is the network structure, which is ubiquitous in high-dimensional data because of strong or weak correlations among variables. Although discriminant analysis is one of the supervised learning methods to deal with multiclassification and relevant extensions have been explored, little method has been available to handle multiclassification with network structures accommodated. To incorporate network structures in predictors and improve the accuracy of classification, we propose network-based linear discriminant analysis and network-based quadratic discriminant analysis in this paper. The main advantage of the proposed methods is to estimate the inverse of covariance matrices directly and do classification for multi-label responses instead of restricting on binary responses. In addition, the proposed methods are easy to compute and implement. Finally, numerical studies are conducted to assess the performance of the proposed methods, and numerical results verify that the proposed methods outperform their competitors.
引用
收藏
页码:410 / 431
页数:22
相关论文
共 34 条
[1]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267, DOI [10.1007/978-1-4612-0919-5_38, DOI 10.1007/978-1-4612-0919-5_38]
[2]   New algorithms for multi-class cancer diagnosis using tumor gene expression signatures [J].
Bagirov, AM ;
Ferguson, B ;
Ivkovic, S ;
Saunders, G ;
Yearwood, J .
BIOINFORMATICS, 2003, 19 (14) :1800-1807
[3]   BAYESIAN SPARSE GRAPHICAL MODELS FOR CLASSIFICATION WITH APPLICATION TO PROTEIN EXPRESSION DATA [J].
Baladandayuthapani, Veerabhadran ;
Talluri, Rajesh ;
Ji, Yuan ;
Coombes, Kevin R. ;
Lu, Yiling ;
Hennessy, Bryan T. ;
Davies, Michael A. ;
Mallick, Bani K. .
ANNALS OF APPLIED STATISTICS, 2014, 8 (03) :1443-1468
[4]   PCA disjoint models for multiclass cancer analysis using gene expression data [J].
Bicciato, S ;
Luchini, A ;
Di Bello, C .
BIOINFORMATICS, 2003, 19 (05) :571-578
[5]   Multi-dimensional classification with Bayesian networks [J].
Bielza, C. ;
Li, G. ;
Larranaga, P. .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2011, 52 (06) :705-727
[6]   Network linear discriminant analysis [J].
Cai, Wei ;
Guan, Guoyu ;
Pan, Rui ;
Zhu, Xuening ;
Wang, Hansheng .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 117 :32-44
[7]   EXTENDED BIC FOR SMALL-n-LARGE-P SPARSE GLM [J].
Chen, Jiahua ;
Chen, Zehua .
STATISTICA SINICA, 2012, 22 (02) :555-574
[8]  
Chen L.-P., 2019, UWSPACE
[9]  
Chen L-P., 2018, BIOSTATISTICS BIOMET, V9, DOI [10.19080/BBOAJ.2018.09.555751, DOI 10.19080/BBOAJ.2018.09.555751]
[10]  
Chen LP, 2019, Journal of Statistical Distributions and Applications, V6, DOI [10.1186/s40488-019-0094-2, 10.1186/s40488-019-0094-2, DOI 10.1186/S40488-019-0094-2]