Author name disambiguation: What difference does it make in author-based citation analysis?

被引:99
作者
Strotmann, Andreas [1 ]
Zhao, Dangzhi [2 ]
机构
[1] GESIS Leibniz Inst Social Sci, D-50667 Cologne, Germany
[2] Univ Alberta, Sch Lib & Informat Studies, Edmonton, AB T6G 2J4, Canada
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2012年 / 63卷 / 09期
关键词
bibliometrics; co-citation analysis; citation analysis; INFORMATION-SCIENCE; COCITATION;
D O I
10.1002/asi.22695
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we explore how strongly author name disambiguation (AND) affects the results of an author-based citation analysis study, and identify conditions under which the traditional simplified approach of using surnames and first initials may suffice in practice. We compare author citation ranking and cocitation mapping results in the stem cell research field from 2004 to 2009 using two AND approaches: the traditional simplified approach of using author surname and first initial and a sophisticated algorithmic approach. We find that the traditional approach leads to extremely distorted rankings and substantially distorted mappings of authors in this field when based on first- or all-author citation counting, whereas last-author-based citation ranking and cocitation mapping both appear relatively immune to the author name ambiguity problem. This is largely because Romanized names of Chinese and Korean authors, who are very active in this field, are extremely ambiguous, but few of these researchers consistently publish as last authors in bylines. We conclude that a more earnest effort is required to deal with the author name ambiguity problem in both citation analysis and information retrieval, especially given the current trend toward globalization. In the stem cell research field, in which laboratory heads are traditionally listed as last authors in bylines, last-author-based citation ranking and cocitation mapping using the traditional approach to author name disambiguation may serve as a simple workaround, but likely at the price of largely filtering out Chinese and Korean contributions to the field as well as important contributions by young researchers.
引用
收藏
页码:1820 / 1833
页数:14
相关论文
共 23 条
  • [1] Andrade M., 2006, WORKSH SCH DAT DAT I
  • [2] Credit where credit is due
    不详
    [J]. NATURE, 2009, 462 (7275) : 825 - 825
  • [3] [Anonymous], CITATION ANAL RES EV
  • [4] Commercialization and Collaboration: Competing Policies in Publicly Funded Stem Cell Research?
    Bubela, Tania
    Strotmann, Andreas
    Adams, Rhiannon
    Morrison, Shawn
    [J]. CELL STEM CELL, 2010, 7 (01) : 25 - 30
  • [5] An Unsupervised Heuristic-Based Hierarchical Method for Name Disambiguation in Bibliographic Citations
    Cota, Ricardo G.
    Ferreira, Anderson A.
    Nascimento, Cristiano
    Goncalves, Marcos Andre
    Laender, Alberto H. F.
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (09): : 1853 - 1870
  • [6] Name disambiguation spectral in author citations using a K-way clustering method
    Han, H
    Zha, HY
    Giles, CL
    [J]. PROCEEDINGS OF THE 5TH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, PROCEEDINGS, 2005, : 334 - 343
  • [7] A day in the life of PubMed: Analysis of a typical day's query log
    Herskovic, Jorge R.
    Tantaka, Len Y.
    Hersh, William
    Bernstam, Elmer V.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2007, 14 (02) : 212 - 220
  • [8] On co-authorship for author disambiguation
    Kang, In-Su
    Na, Seung-Hoon
    Lee, Seungwoo
    Jung, Hanmin
    Kim, Pyung
    Sung, Won-Kyung
    Lee, Jong-Hyeok
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2009, 45 (01) : 84 - 97
  • [9] Scientific publishing: Identity crisis
    Qiu, Jane
    [J]. NATURE, 2008, 451 (7180) : 766 - 767
  • [10] Scopus, 2009, SCOP AUTH ID