OAG-BERT: Towards A Unified Backbone Language Model For Academic Knowledge Services

被引:18
作者
Liu, Xiao [1 ]
Yin, Da [1 ]
Zheng, Jingnan
Zhang, Xingjian [1 ,2 ]
Zhang, Peng [3 ]
Yang, Hongxia [4 ]
Dong, Yuxiao [1 ]
Tang, Jie [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Zhipu AI, Beijing, Peoples R China
[4] Alibaba Grp, DAMO Acad, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022 | 2022年
基金
美国国家科学基金会;
关键词
Pre-Training; Language Model; Heterogeneous Knowledge Graph; RECOMMENDATION;
D O I
10.1145/3534678.3539210
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Academic knowledge services have substantially facilitated the development of the science enterprise by providing a plenitude of efficient research tools. However, many applications highly depend on ad-hoc models and expensive human labeling to understand scientific contents, hindering deployments into real products. To build a unified backbone language model for different knowledge-intensive academic applications, we pre-train an academic language model OAG-BERT that integrates both the heterogeneous entity knowledge and scientific corpora in the Open Academic Graph (OAG)-the largest public academic graph to date. In OAG-BERT, we develop strategies for pre-training text and entity data along with zero-shot inference techniques. OAG-BERT achieves outperformance over baselines on nine academic tasks including two demo applications, demonstrating its potential to serve as one foundation model for academic knowledge services. Its zero-shot capability furthers the path to mitigate the need of expensive annotations. OAG-BERT has been deployed for real-world applications, such as the reviewer recommendation function for National Nature Science Foundation of China (NSFC)-one of the largest funding agencies in China-and paper tagging in AMiner (https://www.aminer.cn). All codes and pre-trained models are available via the CogDL toolkit(1).
引用
收藏
页码:3418 / 3428
页数:11
相关论文
共 55 条
[1]   Recommendation of scholarly venues based on dynamic user interests [J].
Alhoori, Hamed ;
Furuta, Richard .
JOURNAL OF INFORMETRICS, 2017, 11 (02) :553-563
[2]  
Ammar Waleed, 2018, NAACL-HLT, V3, P84, DOI 10.18653/v1/n18-3011
[3]  
[Anonymous], 2017, International Studies
[4]  
[Anonymous], 2006, Proceedings of the Seventeenth Conference on Hypertext and Hypermedia, DOI DOI 10.1145/1149941.1149949
[5]  
[Anonymous], 2010, P 10 ANN JOINT C DIG, DOI DOI 10.1145/1816123.1816129
[6]  
Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615
[7]   Supervised Machine Learning applied to Link Prediction in Bipartite Social Networks [J].
Benchettara, Nesserine ;
Kanawati, Rushed ;
Rouveirol, Celine .
2010 INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2010), 2010, :326-330
[8]  
Brown T. B., 2020, Advances in Neural Information Processing Systems
[9]  
Cen Yukuo, 2021, ARXIV210300959
[10]  
Chen Bo, 2020, TKDE