Some results on the maximal correlation in 2 x k contingency tables

被引：6

作者：

Gautam, S ^{[1
]}

Kimeldorf, G

机构：

[1] Vanderbilt Univ, Sch Med, Dept Prevent Med, Div Biostat,Med Ctr N A1124, Nashville, TN 37232 USA

[2] Univ Texas, Richardson, TX 75083 USA

来源：

AMERICAN STATISTICIAN | 1999年 / 53卷 / 04期

关键词：

dual scaling; nominal categorical data; optimal scaling;

D O I：

10.2307/2686053

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

For 2 x k contingency tables, we consider the statistic r*, the maximal correlation between the row and column variables, where the maximum is taken over all possible sets of scores (or "scales" or "weights") assigned to the k categories. For general m x k contingency tables, methods involving the maximization over sets of scores assigned to the categories (called dual-scaling methods) have been criticized for lack of statistical interpretation and for difficulty of computation. For the case m = 2, however, where nominal categorical data on two populations are compared, this article shows that r* has meaningful interpretations as a multiple correlation coefficient, as a numerical measure of association, and as an upper bound on correlation for reduced tables. These interpretations lead to a better understanding of the nature of the association between the two variables. These interpretations also yield insight into the role of the usual chi-square statistic for 2 x k tables. Furthermore, both r" and the set of scores at which this maximum is achieved are shown to have simple closed-form expressions. These scores are used to furnish a simple proof that the asymptotic distribution of nr*(2), based on a sample of size n, is a chi(2) distribution with k - 1 degrees of freedom.

引用

页码：336 / 341

页数：6

共 15 条

[1] Agresti A., 1990, Analysis of categorical data
[2] AITKIN M, 1982, J ROYAL STAT SOC A, V145, P513
[3] [Anonymous], 1973, STAT METHODS RES WOR
[4] Bishop M.M., 1975, DISCRETE MULTIVARIAT
[5] Freeman DH, 1987, Applied categorical data analysis
[6] Optimized scorings for ordinal data for the general linear model
Gautam, S
Kimeldorf, G
Sampson, AR
[J]. STATISTICS & PROBABILITY LETTERS, 1996, 27 (03) : 231 - 239
[7] Goodman LA, 1979, MEASURES ASS CROSS C, P19792
[8] CHOICE OF COLUMN SCORES FOR TESTING INDEPENDENCE IN ORDERED 2XK CONTINGENCY-TABLES
GRAUBARD, BI
KORN, EL
[J]. BIOMETRICS, 1987, 43 (02) : 471 - 476
[9] TESTS FOR INDEPENDENCE IN 2-WAY CONTINGENCY-TABLES BASED ON CANONICAL CORRELATION AND ON LINEAR-BY-LINEAR INTERACTION
HABERMAN, SJ
[J]. ANNALS OF STATISTICS, 1981, 9 (06) : 1178 - 1186
[10] HELMES E, 1986, J CLIN PSYCHOL, V42, P569, DOI 10.1002/1097-4679(198607)42:4<569::AID-JCLP2270420405>3.0.CO

← 1 2 →