An Example of Empirical Approach for Bibliographic Record Linkage

被引:0
作者
Knyazeva, Anna [1 ]
Kolobov, Oleg [2 ]
Turchanovsky, Igor [1 ]
机构
[1] Tomsk Polytech Univ, Inst Computat Technol, Tomsk, Russia
[2] Inst High Current Elect, Tomsk, Russia
来源
2016 IEEE TENTH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS) | 2016年
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The record linkage problem in application to a bibliographic and authority data is considered. The problem is common in the situation of merging data from several libraries. The two approaches based on empirical analysis of data are tested. Both of them involve an indirect information about a person. The proposed variant of the decision tree method allows us to deal with inconsistent bibliographic data and to use particular rules one by one for improving of record linkage quality. The study was performed on data of several Russian libraries. The data we deal with are in RUSMARC format which is a variant of UNIMARC popular in Russia.
引用
收藏
页码:421 / 426
页数:6
相关论文
共 27 条
[1]  
[Anonymous], NIPS
[2]  
[Anonymous], 2010, P 10 ANN JOINT C DIG, DOI 10.1145/1816123.1816130
[3]  
Apanovich Z., 2015, P 17 INT C DAMDID RC, P91
[4]  
Backstage Library Works, AUTH CONTR
[5]  
Baeza-Yates R, 1999, MODERN INFORM RETRIE, V463
[6]  
Baxter R, 2003, P ACM SIGKDD 2003 WO
[7]  
Bennett Rick, 2006, VIAF VIRTUAL INT AUT
[8]   A Heuristic Approach to Author Name Disambiguation in Bibliometrics Databases for Large-Scale Research Assessments [J].
D'Angelo, Ciriaco Andrea ;
Giuffrida, Cristiano ;
Abramo, Giovanni .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (02) :257-269
[9]  
de Carvalho MG, 2006, OPENING INFORMATION HORIZONS, P41
[10]  
ELFEKY MG, 2002, P 18 INT C DAT ENG I