Online author name disambiguation in evolving digital library

被引:2
作者
Pooja, K. M. [1 ]
Mondal, Samrat [1 ]
Chandra, Joydeep [1 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna, India
关键词
Author name disambiguation; Dynamic graph embedding; Digital library; Academic social network;
D O I
10.1016/j.neucom.2021.07.104
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Name ambiguity is a prevalent problem in digital library domain where mapping of bibliographic records to authors isa major issue. The unprecedented growth of the bibliographic records and absence of unique identifiers are further exacerbating the problem. Specifically, name ambiguity affects various bibliometric analysis tasks that include record management as well as scientific assessment of the authors thereby necessitating the name disambiguation. The name disambiguation task is to assign the records, possibly with the ambiguous authorship, to corresponding authors. While existing techniques are good at extract-ing abstract features from set of records with a common author name that can be subsequently used for clustering the records based on unique author identities, however, such techniques usually perform poorly in disambiguating isolated individual record entries that arrive continuously. Disambiguation of only newly arrived records, rather than the whole records of the digital library is challenging, however, computationally rewarding and thus, not only preferable but becoming the necessity due to tremendous growth in the number of bibliographic records with the time, which is likely to continue. In this regard, we propose an online author name disambiguation approach for evolving digital library. Our approach involves representation learning of records in an online manner in evolving (academic networks) digital library using dynamic graph embedding and clustering of latent representation of records. We show the use of our online name disambiguation method in batch setting (for static or initial records of digital library) and incremental setting (for new records of digital library). Significant improvement, over exist-ing state-of-the-art methods in terms of various evaluation metrics, has been observed which indicates the effectiveness of the proposed approach.(c) 2022 Published by Elsevier B.V.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 45 条
[11]  
Francq P, 2011, COLLABORATIVE SEARCH AND COMMUNITIES OF INTEREST: TRENDS IN KNOWLEDGE SHARING AND ASSESSMENT, P98, DOI 10.4018/978-1-61520-841-8.ch006
[12]  
Hamilton WL, 2017, ADV NEUR IN, V30
[13]   Two supervised learning approaches for name disambiguation in author citations [J].
Han, H ;
Giles, L ;
Zha, H ;
Li, C ;
Tsioutsiouliklis, K .
JCDL 2004: PROCEEDINGS OF THE FOURTH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES: GLOBAL REACH AND DIVERSE IMPACT, 2004, :296-305
[14]  
Jaccard P., 1901, Bull Soc Vaudoise Sci Nat, V37, P241
[15]   Online Person Name Disambiguation with Constraints [J].
Khabsa, Madian ;
Treeratpituk, Pucktada ;
Giles, C. Lee .
PROCEEDINGS OF THE 15TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL'15), 2015, :37-46
[16]   A fast and integrative algorithm for clustering performance evaluation in author name disambiguation [J].
Kim, Jinseok .
SCIENTOMETRICS, 2019, 120 (02) :661-681
[17]  
Kipf T.N., 2017, INT C LEARN REPR ICL
[18]  
Lapidot I, 2002, SELF ORGANIZING MAPS
[19]   Online Chinese Restaurant Process [J].
Liu, Chien-Liang ;
Tsai, Tsung-Hsun ;
Lee, Chia-Hoang .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :591-600
[20]   Fuzzy-aided solution for out-of-view challenge in visual tracking under IoT-assisted complex environment [J].
Liu, Shuai ;
Liu, Xinyu ;
Wang, Shuai ;
Muhammad, Khan .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04) :1055-1065