A Brief Survey of Automatic Methods for Author Name Disambiguation

被引:185
作者
Ferreira, Anderson A. [1 ,2 ]
Goncalves, Marcos Andre [2 ]
Laender, Alberto H. F. [2 ]
机构
[1] Univ Fed Ouro Preto, Dept Comp, BR-35400000 Ouro Preto, Brazil
[2] Univ Fed Minas Gerais, Dept Ciencia Comp, BR-31270901 Belo Horizonte, MG, Brazil
关键词
RESOLUTION; MODEL;
D O I
10.1145/2350036.2350040
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Name ambiguity in the context of bibliographic citation records is a hard problem that affects the quality of services and content in digital libraries and similar systems. The challenges of dealing with author name ambiguity have led to a myriad of disambiguation methods. Generally speaking, the proposed methods usually attempt to group citation records of a same author by finding some similarity among them or try to directly assign them to their respective authors. Both approaches may either exploit supervised or unsupervised techniques. In this article, we propose a taxonomy for characterizing the current author name disambiguation methods described in the literature, present a brief survey of the most representative ones and discuss several open challenges.
引用
收藏
页码:15 / 26
页数:12
相关论文
共 51 条
[1]  
[Anonymous], J DATA INF QUAL
[2]  
[Anonymous], 1998, PROC 1 LANGUAGE RESO
[3]  
[Anonymous], 2005, Data Mining: Concepts and Techniques
[4]  
[Anonymous], 2005, WWW '05
[5]  
[Anonymous], 2011, Journal of Information and Data Management
[6]  
[Anonymous], 2010, P 10 ANN JOINT C DIG, DOI 10.1145/1816123.1816130
[7]  
[Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
[8]  
[Anonymous], 2003, IIWeb
[9]  
Bhattacharya I., 2007, ACM TKDD, V1
[10]  
Bhattacharya I., 2006, SDM