Compositional data;
Regression;
alpha-Transformation;
k-N N algorithm;
Kernel regression;
STATISTICAL-ANALYSIS;
D O I:
10.1007/s11222-023-10277-5
中图分类号:
TP301 [理论、方法];
学科分类号:
081202 ;
摘要:
Compositional data arise in many real-life applications and versatile methods for properly analyzing this type of data in the regression context are needed. When parametric assumptions do not hold or are difficult to verify, non-parametric regression models can provide a convenient alternative method for prediction. To this end, we consider an extension to the classical k-Nearest Neighbours (k-N N) regression, that yields a highly flexible non-parametric regression model for compositional data. A similar extension of kernel regression is proposed by adopting the Nadaraya-Watson estimator. Both extensions involve a power transformation termed the alpha-transformation. Unlike many of the recommended regression models for compositional data, zeros values (which commonly occur in practice) are not problematic and they can be incorporated into the proposed models without modification. Extensive simulation studies and real-life data analyses highlight the advantage of using these non-parametric regressions for complex relationships between compositional response data and Euclidean predictor variables. Both the extended K-N N and kernel regressions can lead to more accurate predictions compared to current regression models which assume a, sometimes restrictive, parametric relationship with the predictor variables. In addition, the extended k-N N regression, in contrast to current regression techniques, enjoys a high computational efficiency rendering it highly attractive for use with large sample data sets.
引用
收藏
页数:17
相关论文
共 62 条
[61]
Watson G. S., 1964, Sankhy: The Indian Journal of Statistics, Series A, V26, P359
机构:
Univ Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R ChinaUniv Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R China
Xia, Fan
;
Chen, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Univ Penn, Dept Biostat & Epidemiol, Perelman Sch Med, Philadelphia, PA 19104 USAUniv Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R China
Chen, Jun
;
Fung, Wing Kam
论文数: 0引用数: 0
h-index: 0
机构:
Univ Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R ChinaUniv Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R China
Fung, Wing Kam
;
Li, Hongzhe
论文数: 0引用数: 0
h-index: 0
机构:
Univ Penn, Dept Biostat & Epidemiol, Perelman Sch Med, Philadelphia, PA 19104 USAUniv Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R China
机构:
Univ Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R ChinaUniv Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R China
Xia, Fan
;
Chen, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Univ Penn, Dept Biostat & Epidemiol, Perelman Sch Med, Philadelphia, PA 19104 USAUniv Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R China
Chen, Jun
;
Fung, Wing Kam
论文数: 0引用数: 0
h-index: 0
机构:
Univ Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R ChinaUniv Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R China
Fung, Wing Kam
;
Li, Hongzhe
论文数: 0引用数: 0
h-index: 0
机构:
Univ Penn, Dept Biostat & Epidemiol, Perelman Sch Med, Philadelphia, PA 19104 USAUniv Hong Kong, Dept Stat & Actuarial Sci, Pokfulam, Hong Kong, Peoples R China