Rejoinder on: Compositional data: the sample space and its structure

被引:2
作者
Jose Egozcue, Juan [1 ]
Pawlowsky-Glahn, Vera [2 ]
机构
[1] Univ Politecn Cataluna, Jordi Girona 1-3,Mod C2, Barcelona, Spain
[2] Univ Girona, Campus Montilivi P4, Girona, Spain
关键词
Aitchison geometry; Biplot; Dendrogram; Equivalence class; Euclidean space; Household income; Isometric log-ratio coordinates; Logistic-normal; Normal distribution on the simplex; Principal balances; Principal components; Simplex;
D O I
10.1007/s11749-019-00674-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The log-ratio approach to compositional data (CoDa) analysis has now entered a mature phase. The principles and statistical tools introduced by J. Aitchison in the eighties have proven successful in solving a number of applied problems. The algebraic–geometric structure of the sample space, tailored to those principles, was developed at the beginning of the millennium. Two main ideas completed the J. Aitchison’s seminal work: the conception of compositions as equivalence classes of proportional vectors, and their representation in the simplex endowed with an interpretable Euclidean structure. These achievements allowed the representation of compositions in meaningful coordinates (preferably Cartesian), as well as orthogonal projections compatible with the Aitchison distance introduced two decades before. These ideas and concepts are reviewed up to the normal distribution on the simplex and the associated central limit theorem. Exploratory tools, specifically designed for CoDa, are also reviewed. To illustrate the adequacy and interpretability of the sample space structure, a new inequality index, based on the Aitchison norm, is proposed. Most concepts are illustrated with an example of mean household gross income per capita in Spain. © 2019, Sociedad de Estadística e Investigación Operativa.
引用
收藏
页码:658 / 663
页数:6
相关论文
共 13 条
[1]  
AITCHISON J, 1983, BIOMETRIKA, V70, P57
[2]   Biplots of compositional data [J].
Aitchison, J ;
Greenacre, M .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2002, 51 :375-392
[3]  
Aitchison J., 2003, Monographs on statistics and applied probability
[4]  
[Anonymous], STUDIES MATH GEOLOGY
[5]   The Mathematics of Compositional Analysis [J].
Barcelo-Vidal, Caries ;
Martin-Fernandez, Josep-Antoni .
AUSTRIAN JOURNAL OF STATISTICS, 2016, 45 (04) :57-71
[6]   Evidence functions: a compositional approach to information [J].
Egozcue, J. J. ;
Pawlowsky-Glahn, V .
SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2018, 42 (02) :101-124
[7]   Groups of parts and their balances in compositional data analysis [J].
Egozcue, JJ ;
Pawlowsky-Glahn, V .
MATHEMATICAL GEOLOGY, 2005, 37 (07) :795-828
[8]   Isometric logratio transformations for compositional data analysis [J].
Egozcue, JJ ;
Pawlowsky-Glahn, V ;
Mateu-Figueras, G ;
Barceló-Vidal, C .
MATHEMATICAL GEOLOGY, 2003, 35 (03) :279-300
[9]   Outlier detection for compositional data using robust methods [J].
Filzmoser, Peter ;
Hron, Karel .
MATHEMATICAL GEOSCIENCES, 2008, 40 (03) :233-248
[10]   Variable Selection in Compositional Data Analysis Using Pairwise Logratios [J].
Greenacre, Michael .
MATHEMATICAL GEOSCIENCES, 2019, 51 (05) :649-682