SIMPLEX-BASED MULTINOMIAL LOGISTIC REGRESSION WITH DIVERGING NUMBERS OF CATEGORIES AND COVARIATE

被引:5
作者
Fu, Sheng [1 ]
Chen, Piao [2 ]
Liu, Yufeng [3 ]
Ye, Zhisheng [1 ]
机构
[1] Natl Univ Singapore, Dept Ind Syst Engn & Management, Singapore 119077, Singapore
[2] Delft Univ Technol, Delft Inst Appl Math, NL-2628 XE Delft, Netherlands
[3] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会;
关键词
Asymptotics; classification; Fisher consistency; kernel learning; MLR; simplex coding scheme; MAXIMUM-LIKELIHOOD-ESTIMATION; GENERALIZED LINEAR-MODELS; ASYMPTOTIC-BEHAVIOR; LOGIT-MODELS; CLASSIFICATION; PARAMETERS; CONSISTENCY; ESTIMATORS;
D O I
10.5705/ss.202021.0082
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multinomial logistic regression models are popular in multicategory classification analysis, but existing models suffer several intrinsic drawbacks. In particular, the parameters cannot be determined uniquely because of the over-specification. Although additional constraints have been imposed to refine the model, such modifications can be inefficient and complicated. In this paper, we propose a novel and efficient simplex-based multinomial logistic regression technique, seamlessly connecting binomial and multinomial cases under a unified framework. Compared with existing models, our model has fewer parameters, is free of any constraints, and can be solved efficiently using the Fisher scoring algorithm. In addition, the proposed model enjoys several theoretical advantages, including Fisher consistency and sharp comparison inequality. Under mild conditions, we establish the asymptotical normality and convergence for the new model, even when the numbers of categories and covariates increase with the sample size. The proposed framework is illustrated by means of extensive simulations and real applications.
引用
收藏
页码:2463 / 2493
页数:31
相关论文
共 61 条
[1]  
ALBERT A, 1984, BIOMETRIKA, V71, P1
[2]   SEPARATE SAMPLE LOGISTIC DISCRIMINATION [J].
ANDERSON, JA .
BIOMETRIKA, 1972, 59 (01) :19-35
[3]  
ANDERSON JA, 1982, BIOMETRIKA, V69, P123, DOI 10.1093/biomet/69.1.123
[4]  
[Anonymous], 2002, COLING 02 6 C NATURA
[5]  
[Anonymous], 2013, Softmax Regression
[6]  
[Anonymous], 2012, P 25 INT C NEUR INF
[7]   THE MULTINOMIAL-POISSON TRANSFORMATION [J].
BAKER, SG .
STATISTICIAN, 1994, 43 (04) :495-504
[8]   Convexity, classification, and risk bounds [J].
Bartlett, PL ;
Jordan, MI ;
McAuliffe, JD .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) :138-156
[9]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[10]   MULTINOMIAL LOGISTIC-REGRESSION ALGORITHM [J].
BOHNING, D .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1992, 44 (01) :197-200