Variable screening based on Gaussian Centered L-moments

被引:1
作者
An, Hyowon [1 ,3 ]
Zhang, Kai [1 ]
Oja, Hannu [2 ]
Marron, J. S. [1 ]
机构
[1] Univ N Carolina, Chapel Hill, NC USA
[2] Univ Turku, Turku 20500, Finland
[3] 615 Pavonia Ave,APT 1110, Jersey City, NJ 07306 USA
关键词
Robust statistics; L-moments; L-statistics; Skewness; Kurtosis; TESTS; NORMALITY; SKEWNESS; KURTOSIS;
D O I
10.1016/j.csda.2022.107632
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
An important challenge in big data is identification of important variables. For this purpose, methods of discovering variables with non-standard univariate marginal distributions are proposed. The conventional moments based summary statistics can be well-adopted, but their sensitivity to outliers can lead to selection based on a few outliers rather than distributional shape such as bimodality. To address this type of non-robustness, the L -moments are considered. Using these in practice, however, has a limitation since they do not take zero values at the Gaussian distributions to which the shape of a marginal distribution is most naturally compared. As a remedy, Gaussian Centered L-moments are proposed, which share advantages of the L-moments, but have zeros at the Gaussian distributions. The strength of Gaussian Centered L-moments over other conventional moments is shown in theoretical and practical aspects such as their performances in screening important genes in cancer genetics data.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 36 条
[1]  
An H., 2017, THESIS U N CAROLINA
[2]  
[Anonymous], WILEY SERIES PROBABI
[3]  
[Anonymous], 2009, Wiley Series in Probability and Statistics, DOI DOI 10.1002/9780470434697.CH7
[4]  
BOWLEY A., 1920, Elements of Statistics, V4
[5]   AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252
[6]   Normal scores, normal plots, and tests for normality [J].
Brown, BM ;
Hettmansperger, TP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (436) :1668-1675
[7]   A robust measure of skewness [J].
Brys, G ;
Hubert, M ;
Struyf, A .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2004, 13 (04) :996-1017
[8]   Robust measures of tail weight [J].
Brys, G ;
Hubert, M ;
Struyf, A .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (03) :733-759
[9]   STATISTICAL TREATMENT OF CENSORED DATA .1. FUNDAMENTAL FORMULAE [J].
DAVID, FN ;
JOHNSON, NL .
BIOMETRIKA, 1954, 41 (1-2) :228-240
[10]  
Decurninge A, 2015, Arxiv, DOI arXiv:1409.6013