Robust Covariance Matrix Estimation for High-Dimensional Compositional Data with Application to Sales Data Analysis

被引:6
作者
Li, Danning [1 ,2 ]
Srinivasan, Arun [3 ]
Chen, Qian [4 ]
Xue, Lingzhou [3 ]
机构
[1] Northeast Normal Univ, Sch Math & Stat, Changchun, Peoples R China
[2] Northeast Normal Univ, KLAS, Changchun, Peoples R China
[3] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[4] Penn State Univ, Dept Supply Chain & Informat Syst, University Pk, PA 16802 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Cross-validation; Huber's M-estimator; Robustness; Thresholding; REGRESSION;
D O I
10.1080/07350015.2022.2106990
中图分类号
F [经济];
学科分类号
02 ;
摘要
Compositional data arises in a wide variety of research areas when some form of standardization and composition is necessary. Estimating covariance matrices is of fundamental importance for high-dimensional compositional data analysis. However, existing methods require the restrictive Gaussian or sub-Gaussian assumption, which may not hold in practice. We propose a robust composition adjusted thresholding covariance procedure based on Huber-type M-estimation to estimate the sparse covariance structure of high-dimensional compositional data. We introduce a cross-validation procedure to choose the tuning parameters of the proposed method. Theoretically, by assuming a bounded fourth moment condition, we obtain the rates of convergence and signal recovery property for the proposed method and provide the theoretical guarantees for the cross-validation procedure under the high-dimensional setting. Numerically, we demonstrate the effectiveness of the proposed method in simulation studies and also a real application to sales data analysis.
引用
收藏
页码:1090 / 1100
页数:11
相关论文
共 27 条
[1]  
AITCHISON J, 1984, BIOMETRIKA, V71, P323
[2]  
AITCHISON J, 1982, J ROY STAT SOC B, V44, P139
[3]  
AITCHISON J, 1983, BIOMETRIKA, V70, P57
[4]  
ANDREWS DF, 1974, J ROY STAT SOC B MET, V36, P99
[5]  
Arata Y., 2017, Discussion Papers
[6]   Robust estimation of high-dimensional covariance and precision matrices [J].
Avella-Medina, Marco ;
Battey, Heather S. ;
Fan, Jianqing ;
Li, Quefeng .
BIOMETRIKA, 2018, 105 (02) :271-284
[7]   COVARIANCE REGULARIZATION BY THRESHOLDING [J].
Bickel, Peter J. ;
Levina, Elizaveta .
ANNALS OF STATISTICS, 2008, 36 (06) :2577-2604
[8]   Adaptive Thresholding for Sparse Covariance Matrix Estimation [J].
Cai, Tony ;
Liu, Weidong .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :672-684
[9]   ON THE THEORY OF ELLIPTICALLY CONTOURED DISTRIBUTIONS [J].
CAMBANIS, S ;
HUANG, S ;
SIMONS, G .
JOURNAL OF MULTIVARIATE ANALYSIS, 1981, 11 (03) :368-385
[10]   Large Covariance Estimation for Compositional Data Via Composition-Adjusted Thresholding [J].
Cao, Yuanpei ;
Lin, Wei ;
Li, Hongzhe .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) :759-772