A review of name-based ethnicity classification methods and their potential in population studies

被引:119
作者
Mateos, Pablo
机构
[1] UCL, Dept Geog, London WC1E 6BT, England
[2] UCL, Ctr Adv Spatial Anal, London WC1E 6BT, England
关键词
name origins; ethnicity classifications; identity measurement; interdisciplinary methods; surnames;
D O I
10.1002/psp.457
中图分类号
C921 [人口统计学];
学科分类号
摘要
Several approaches have been proposed to classify populations into ethnic groups using people's names, as an alternative to ethnicity self-identification information when this is not available. These methodologies have been developed, primarily in the public health and population genetics literature in different countries, in isolation from and with little participation from demographers or social scientists. The objective of this paper is to bring together these isolated efforts and provide a coherent comparison, a common methodology and terminology in order to foster new research and applications in this promising and multidisciplinary field. A systematic review has been conducted of the most representative studies that develop new name-based ethnicity classifications, extracting methodological commonalities, achievements and shortcomings; 13 studies met the inclusion criteria and all followed a very similar methodology to create a name reference list with which to classify populations into a few most common ethnic groups. The different classifications' sensitivity varies between 0.67 and 0.95, their specificity between 0.80 and 1, their positive predicted value between 0.70 and 0.96, and their negative predicted value between 0.96 and 1. Name-based ethnicity classification systems have a great potential to overcome data scarcity issues in a wide variety of key topics in population studies, as is proved by the 13 papers analysed. Their current limitations are mainly due to a restricted number of names and a partial spatio-temporal coverage of the reference population data-sets used to produce name reference lists. Improved classifications with extensive population coverage and higher classification accuracy levels will be achieved by using population registers with wider spatio-temporal coverage. Furthermore, there is a requirement for such new classifications to include all of the potential ethnic groups present in a society, and not just one or a few of them. Copyright (c) 2007 John Wiley & Sons, Ltd.
引用
收藏
页码:243 / 263
页数:21
相关论文
共 116 条
  • [61] PREDICTING JAPANESE-AMERICAN DRINKING BEHAVIOR
    KITANO, HHL
    LUBBEN, JE
    CHI, I
    [J]. INTERNATIONAL JOURNAL OF THE ADDICTIONS, 1988, 23 (04): : 417 - 428
  • [62] KOLEHMAINEN JI, 1939, AM SPEECH, V14, P33
  • [63] Impact of Culture on Depressive Symptoms of Elderly Chinese Immigrants
    Lai, Daniel W. L.
    [J]. CANADIAN JOURNAL OF PSYCHIATRY-REVUE CANADIENNE DE PSYCHIATRIE, 2004, 49 (12): : 820 - 827
  • [64] Large Pete, 2006, Popul Trends, P21
  • [65] Lasker G. W., 1985, SURNAMES GENETIC STR, DOI 10.1017/CBO9780511983351
  • [66] Lasker GW, 1997, HUM BIOL, V69, P733
  • [67] Asian American ethnic identification by surname
    Lauderdale, DS
    Kestenbaum, B
    [J]. POPULATION RESEARCH AND POLICY REVIEW, 2000, 19 (03) : 283 - 300
  • [68] Linguistic Minorities Project, 1985, The other languages of England
  • [69] *LOND HLTH OBS, 2005, US ROUT DAT MEAS ETH
  • [70] London Health Observatory, 2003, MISS REC CAS REC ETH