A new Gini correlation between quantitative and qualitative variables

被引:10
作者
Dang, Xin [1 ]
Nguyen, Dao [1 ]
Chen, Yixin [2 ]
Zhang, Junying [3 ]
机构
[1] Univ Mississippi, Dept Math, 315 Hume Hall, University, MS 38677 USA
[2] Univ Mississippi, Dept Comp & Informat Sci, University, MS 38677 USA
[3] Taiyuan Univ Technol, Dept Math, Taiyuan, Peoples R China
关键词
distance correlation; energy distance; Gini correlation; Gini mean difference; CORRELATION-COEFFICIENT; COVARIANCE; ENERGY;
D O I
10.1111/sjos.12490
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a new Gini correlation to measure dependence between a categorical and numerical variables. Analogous to PearsonR(2)in ANOVA model, the Gini correlation is interpreted as the ratio of the between-group variation and the total variation, but it characterizes independence (zero Gini correlation mutually implies independence). Closely related to the distance correlation, the Gini correlation is of simple formulation by considering the nature of categorical variable. As a result, the proposed Gini correlation has a simpler computation implementation than the distance correlation and is more straightforward to perform inference. Simulation and real data applications are conducted to demonstrate the advantages.
引用
收藏
页码:1314 / 1343
页数:30
相关论文
共 47 条
[1]  
[Anonymous], 2004, INTERSTAT
[2]   On a new multivariate two-sample test [J].
Baringhaus, L ;
Franz, C .
JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 88 (01) :190-206
[3]   On mutual information estimation for mixed-pair random variables [J].
Beknazaryan, Aleksandr ;
Dang, Xin ;
Sang, Hailin .
STATISTICS & PROBABILITY LETTERS, 2019, 148 :9-16
[4]  
Cramer H., 1946, Mathematical methods of statistics.
[5]   A distribution-free test of independence based on mean variance index [J].
Cui, Hengjian ;
Zhong, Wei .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 139 :117-133
[6]   Model-Free Feature Screening for Ultrahigh Dimenssional Discriminant Analysis [J].
Cui, Hengjian ;
Li, Runze ;
Zhong, Wei .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (510) :630-641
[7]   Gini covariance matrix and its affine equivariant version [J].
Dang, Xin ;
Sang, Hailin ;
Weatherall, Lauren .
STATISTICAL PAPERS, 2019, 60 (03) :291-316
[8]   GINIS MEAN DIFFERENCE REDISCOVERED [J].
DAVID, HA .
BIOMETRIKA, 1968, 55 (03) :573-&
[9]   FORMULA FOR THE GINI COEFFICIENT [J].
DORFMAN, R .
REVIEW OF ECONOMICS AND STATISTICS, 1979, 61 (01) :146-149
[10]  
Dua D, 2019, UCI MACHINE LEARNING