MODELING THE VARIABILITY OF RANKINGS

被引:18
作者
Hall, Peter [1 ]
Miller, Hugh [1 ]
机构
[1] Univ Melbourne, Dept Math & Stat, Melbourne, Vic 3010, Australia
关键词
Bootstrap; exponential distribution; exponential tails; extreme values; order statistics; Pareto distribution; performance rankings; regularly varying tails; BOOTSTRAP; COVERAGE;
D O I
10.1214/10-AOS794
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
For better or for worse, rankings of institutions, such as universities, schools and hospitals, play an important role today in conveying information about relative performance. They inform policy decisions and budgets, and are often reported in the media. While overall rankings can vary markedly over relatively short time periods, it is not unusual to find that the ranks of a small number of "highly performing" institutions remain fixed, even when the data on which the rankings are based are extensively revised, and even when a large number of new institutions are added to the competition. In the present paper, we endeavor to model this phenomenon. In particular, we interpret as a random variable the value of the attribute on which the ranking should ideally be based. More precisely, if p items are to be ranked then the true, but unobserved, attributes are taken to be values of p independent and identically distributed variates. However, each attribute value is observed only with noise, and via a sample of size roughly equal to n, say. These noisy approximations to the true attributes are the quantities that are actually ranked. We show that, if the distribution of the true attributes is light-tailed (e.g., normal or exponential) then the number of institutions whose ranking is correct, even after recalculation using new data and even after many new institutions are added, is essentially fixed. Formally, p is taken to be of order n(C) for any fixed C > 0, and the number of institutions whose ranking is reliable depends very little on p. On the other hand, cases where the number of reliable rankings increases significantly when new institutions are added are those for which the distribution of the true attributes is relatively heavy-tailed, for example, with tails that decay like x(-alpha) for some alpha > 0. These properties and others are explored analytically, under general conditions. A numerical study links the results to outcomes for real-data problems.
引用
收藏
页码:2652 / 2677
页数:26
相关论文
共 26 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
AMOSOVA NN, 1972, VESTNIK LENINGRAD U, V13, P148
[3]   Ranking states' immunization coverage: an example from the National Immunization Survey [J].
Barker, LE ;
Smith, PJ ;
Gerzoff, RB ;
Luman, ET ;
McCauley, MM ;
Strine, TW .
STATISTICS IN MEDICINE, 2005, 24 (04) :605-613
[4]   A Bayesian model for ranking hazardous road sites [J].
Brijs, Tom ;
Karlis, Dimitris ;
van den Bossche, Filip ;
Wets, Geert .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2007, 170 :1001-1017
[5]   A model for identifying and ranking dangerous accident locations: a case study in Flanders [J].
Brijs, Tom ;
Van den Bossche, Filip ;
Wets, Geert ;
Karlis, Dimitris .
STATISTICA NEERLANDICA, 2006, 60 (04) :457-476
[6]  
CESARIO LC, 2003, REV MAT ESTATIST, V21, P7
[7]   An empirical assessment of ranking accuracy in ranked set sampling [J].
Chen, Haiying ;
Stasny, Elizabeth A. ;
Wolfe, Douglas A. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (02) :1411-1419
[8]   A Non-parametric method for defining a global preference ranking of industrial products [J].
Corain, L. ;
Salmaso, L. .
JOURNAL OF APPLIED STATISTICS, 2007, 34 (02) :203-216
[9]   League tables and their limitations: Statistical issues in comparisons of institutional performance [J].
Goldstein, H ;
Spiegelhalter, DJ .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1996, 159 :385-409
[10]   USING THE BOOTSTRAP TO QUANTIFY THE AUTHORITY OF AN EMPIRICAL RANKING [J].
Hall, Peter ;
Miller, Hugh .
ANNALS OF STATISTICS, 2009, 37 (6B) :3929-3959