A Weighted Edge-Count Two-Sample Test for Multivariate and Object Data

被引:35
作者
Chen, Hao [1 ]
Chen, Xu [2 ]
Su, Yi [1 ]
机构
[1] Univ Calif Davis, Dept Stat, Davis, CA 95616 USA
[2] Duke Univ, Dept Stat, Durham, NC USA
基金
美国国家科学基金会;
关键词
Nonparametric test; Permutation null distribution; Similarity graph; Unequal sample sizes; NETWORK;
D O I
10.1080/01621459.2017.1307757
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Two-sample tests for multivariate data and non-Euclidean data are widely used in many fields. Parametric tests are mostly restrained to certain types of data that meets the assumptions of the parametric models. In this article, we study a nonparametric testing procedure that uses graphs representing the similarity among observations. It can be applied to any data types as long as an informative similarity measure on the sample space can be defined. The classic test based on a similarity graph has a problem when the two sample sizes are different. We solve the problem by applying appropriate weights to different components of the classic test statistic. The new test exhibits substantial power gains in simulation studies. Its asymptotic permutation null distribution is derived and shown to work well under finite samples, facilitating its application to large datasets. The new test is illustrated through an analysis on a real dataset of network data.
引用
收藏
页码:1146 / 1155
页数:10
相关论文
共 14 条
[1]  
[Anonymous], 2020, Nonparametric Statistical Inference, DOI DOI 10.1201/9781439896129
[2]   A New Graph-Based Two-Sample Test for Multivariate and Object Data [J].
Chen, Hao ;
Friedman, Jerome H. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) :397-409
[3]   GRAPH-BASED TESTS FOR TWO-SAMPLE COMPARISONS OF CATEGORICAL DATA [J].
Chen, Hao ;
Zhang, Nancy R. .
STATISTICA SINICA, 2013, 23 (04) :1479-1503
[4]  
Chen LHY, 2005, LECT NOTES SER INST, V4, P1
[5]   Clinical Features of 8295 Patients With Resistant Hypertension Classified on the Basis of Ambulatory Blood Pressure Monitoring [J].
de la Sierra, Alejandro ;
Segura, Julian ;
Banegas, Jose R. ;
Gorostidi, Manuel ;
de la Cruz, Juan J. ;
Armario, Pedro ;
Oliveras, Anna ;
Ruilope, Luis M. .
HYPERTENSION, 2011, 57 (05) :898-U74
[6]   Inferring friendship network structure by using mobile phone data [J].
Eagle, Nathan ;
Pentland, Alex ;
Lazer, David .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (36) :15274-15278
[7]   Is disorganization a feature of schizophrenia or a modifying influence: Evidence of covariation of perceptual and cognitive organization in a non-patient sample [J].
Feigenson, Keith A. ;
Gara, Michael A. ;
Roche, Matthew W. ;
Silverstein, Steven M. .
PSYCHIATRY RESEARCH, 2014, 217 (1-2) :1-8
[8]   MULTIVARIATE GENERALIZATIONS OF THE WALD-WOLFOWITZ AND SMIRNOV 2-SAMPLE TESTS [J].
FRIEDMAN, JH ;
RAFSKY, LC .
ANNALS OF STATISTICS, 1979, 7 (04) :697-717
[10]  
Henze N, 1999, ANN STAT, V27, P290