A LARGE SCALE ANALYSIS OF LOGISTIC REGRESSION: ASYMPTOTIC PERFORMANCE AND NEW INSIGHTS

被引:0
|
作者
Mai, Xiaoyi [1 ,2 ]
Liao, Zhenyu [1 ,2 ]
Couillet, Romain [1 ,2 ]
机构
[1] Univ Paris Saclay, Cent Supelec, St Aubin, France
[2] Univ Grenoble Alpes, GIPSA Lab, Grenoble, France
来源
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年
关键词
High dimensional statistic; logistic regression; machine learning; random matrix theory;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Logistic regression, one of the most popular machine learning binary classification methods, has been long believed to be unbiased. In this paper, we consider the "hard" classification problem of separating high dimensional Gaussian vectors, where the data dimension p and the sample size n are both large. Based on recent advances in random matrix theory (RMT) and high dimensional statistics, we evaluate the asymptotic distribution of the logistic regression classifier and consequently, provide the associated classification performance. This brings new insights into the internal mechanism of logistic regression classifier, including a possible bias in the separating hyperplane, as well as on practical issues such as hyper-parameter tuning, thereby opening the door to novel RMT-inspired improvements.
引用
收藏
页码:3357 / 3361
页数:5
相关论文
共 50 条
  • [31] PAIRWISE INTERACTION ANALYSIS OF LOGISTIC REGRESSION MODELS
    Xu, Easton Li
    Qian, Xiaoning
    Liu, Tie
    Cui, Shuguang
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 187 - 191
  • [32] Logistic regression and CART in the analysis of multimarker studies
    Muller, Reinhold
    Moeckel, Martin
    CLINICA CHIMICA ACTA, 2008, 394 (1-2) : 1 - 6
  • [33] Logistic Regression Model Optimization and Case Analysis
    Zou, Xiaonan
    Hu, Yong
    Tian, Zhewen
    Shen, Kaiyuan
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 135 - 139
  • [34] Logistic regression analysis on the determinants of stillbirth in Ethiopia
    Kidanemariam Alem Berhie
    Habtamu Gebremariam Gebresilassie
    Maternal Health, Neonatology and Perinatology, 2 (1)
  • [35] Analysis of the posterior for spline estimators in logistic regression
    Raghavan, N
    Cox, DD
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1998, 71 (1-2) : 117 - 136
  • [36] Logistic Regression Revisited: Belief Function Analysis
    Denoeux, Thierry
    BELIEF FUNCTIONS: THEORY AND APPLICATIONS, BELIEF 2018, 2018, 11069 : 57 - 64
  • [37] Logistic regression analysis of customer satisfaction data
    Lawson, Cathy
    Montgomery, Douglas C.
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2006, 22 (08) : 971 - 984
  • [38] University Information System's Impact on Academic Performance: A Comprehensive Logistic Regression Analysis with Principal Component Analysis and Performance Metrics
    Selim, Aybeyan
    Ali, Ilker
    Ristevski, Blagoj
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2024, 13 (02): : 1589 - 1598
  • [39] Extracting New Words with Mutual Information and Logistic Regression
    Chen X.
    Han C.
    An Y.
    Liu L.
    Li Z.
    Yang R.
    Data Analysis and Knowledge Discovery, 2019, 3 (08) : 105 - 113
  • [40] Some new methods to solve multicollinearity in logistic regression
    Asar, Yasin
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (04) : 2576 - 2586