Topic-sensitive expert finding based solely on heterogeneous academic networks

被引:5
作者
Gao, Xiaonan [1 ,2 ]
Wu, Sen [1 ]
Xia, Dawen [2 ,3 ]
Xiong, Hui [2 ,4 ,5 ]
机构
[1] Univ Sci & Technol Beijing, Sch Econ & Management, Beijing 100083, Peoples R China
[2] Rutgers State Univ, Management Sci & Informat Syst Dept, Newark, NJ 07102 USA
[3] Guizhou Minzu Univ, Coll Data Sci & Informat Engn, Guiyang 550025, Peoples R China
[4] Baidu Inc, Baidu Talent Intelligent Ctr, Beijing 100085, Peoples R China
[5] Baidu Res, Baidu Business Intelligence Lab, Beijing 100085, Peoples R China
基金
中国国家自然科学基金;
关键词
High-order relationship; Common features; Representative features; Expert finding; Academic network;
D O I
10.1016/j.eswa.2022.119241
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying experts in a specific research field is an essential and practical task in academia and industry. Although some efforts have been attempted for expert finding, they rely on abundant text mining to reveal the target research topic accurately. Furthermore, it is time-consuming to collect and process these text materials for each candidate, and it is difficult to prepare a precise description for a topic that can cover all sub-topics. To this end, in this study, we develop a new expert finding model called TOpic-Sensitive representative Author identification (TOSA), to find representative authors in a specific research topic from a heterogeneous topic -insensitive academic network involving various topics, with a few well-known experts collected previously who are professional in the target topic. Specifically, we first learn the high-order relationships about authors in the heterogeneous academic network, which contain enough topic-related information. Next, an embedding space is found, where the collected experts are close to each other; in other words, the area they gather indicates the features about the collected authoritative experts. Finally, a prototype is calculated based on the collected experts in the embedding space, and we can rank other unlabeled authors by measuring the closeness between them and the prototype. Along this line, in addition to the collected experts in advance, more authoritative experts in the target topic will be identified. Extensive experiments on DBLP and ACM datasets demonstrate that the proposed TOSA model can identify experts who are professional in the query topic directly from a heterogeneous topic-insensitive academic network without text data processing, and its performance is significantly superior to the baseline models.
引用
收藏
页数:16
相关论文
共 59 条
[1]  
Ali Z., 2021, NEURAL COMPUT APPL
[2]   Paper recommendation based on heterogeneous network embedding [J].
Ali, Zafar ;
Qi, Guilin ;
Muhammad, Khan ;
Ali, Bahadar ;
Abro, Waheed Ahmed .
KNOWLEDGE-BASED SYSTEMS, 2020, 210
[3]   Topic-based heterogeneous rank [J].
Amjad, Tehmina ;
Ding, Ying ;
Daud, Ali ;
Xu, Jian ;
Malic, Vincent .
SCIENTOMETRICS, 2015, 104 (01) :313-334
[4]  
Anjum O, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P518
[5]  
[Anonymous], 2015, Advances in independent component analysis and learning machines
[6]  
[Anonymous], 2012, P 12 ACMIEEE CS JOIN
[7]  
[Anonymous], 2010, ICML
[8]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]  
Bordes A., 2013, P 26 INT C NEUR INF, V2, P2787