Fitting Penalized Logistic Regression Models Using QR Factorization

被引:1
作者
Klimaszewski, Jacek [1 ]
Korzen, Marcin [1 ]
机构
[1] West Pomeranian Univ Technol Szczecin, Fac Comp Sci & Informat Technol, Szczecin, Poland
来源
COMPUTATIONAL SCIENCE - ICCS 2020, PT II | 2020年 / 12138卷
关键词
Newton method; Logistic regression; Regularization; QR factorization; LIKELIHOOD;
D O I
10.1007/978-3-030-50417-5_4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The paper presents improvement of a commonly used learning algorithm for logistic regression. In the direct approach Newton method needs inversion of Hessian, what is cubic with respect to the number of attributes. We study a special case when the number of samples m is smaller than the number of attributes n, and we prove that using previously computed QR factorization of the data matrix, Hessian inversion in each step can be performed significantly faster, that is O (m(3)) or O (m(2)n) instead of O (n(3)) in the ordinary Newton optimization case. We show formally that it can be adopted very effectively to l(2) penalized logistic regression and also, not so effectively but still competitively, for certain types of sparse penalty terms. This approach can be especially interesting for a large number of attributes and relatively small number of samples, what takes place in the so-called extreme learning. We present a comparison of our approach with commonly used learning tools.
引用
收藏
页码:44 / 57
页数:14
相关论文
共 21 条
[1]  
[Anonymous], 2001, The elements of statistical learning, DOI DOI 10.1007/978-0-387-21606-5
[2]  
[Anonymous], 2003, Technical report
[3]  
[Anonymous], 2012, Matrix Computations
[4]  
Boyd S., 2004, Berichte Uber Verteilte Messysteme, P1, DOI DOI 10.1017/CBO9780511804441
[5]   On the nonmonotone line search [J].
Dai, YH .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2002, 112 (02) :315-330
[6]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[7]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871
[8]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[9]  
GREEN PJ, 1984, J ROY STAT SOC B, V46, P149
[10]  
Hastie T., 2003, Expression arrays and the p n problem