Research on semantic representation and citation recommendation of scientific papers with multiple semantics fusion

被引:12
作者
Lu, Yonghe [1 ]
Yuan, Meilu [1 ]
Liu, Jiaxin [1 ]
Chen, Minghong [1 ]
机构
[1] Sun Yat Sen Univ, Sch Informat Management, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Citation Recommendation; Semantic Representation; Scientific Papers; Multiple Semantics Fusion; SciBERT; WORDNET;
D O I
10.1007/s11192-022-04566-5
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the growth in scientific papers, citation recommendation which enables researchers to find useful references efficiently and further to promote academic communication and cooperation has become increasingly important. However, little research has been done to explore how to recognize the semantically relevant references according to research scenarios and the context of the paper citation. Motivated by the research gap, the present study attempts to adopt SciBERT to represent text and expand its semantics through the fusion of WordNet knowledge. Further, core themes from references are automatically extracted by TextRank to solve the problem of incomplete content extraction. In this case, the model named SciBERT + DPCNN is constructed for semantic representation and citation recommendation of scientific papers. Afterwards, multiple experiments are designed and implemented in three parts to verify the effectiveness of the model. The first result is that the outcomes of SciBERT + DPCNN obtain the highest among all baseline models. Additionally, when the model performs in 1 WordNet fusion at the end of the sentence, the best outcomes are 84.72%, 84.80%, 84.72%, and 84.71% in terms of accuracy, precision, recall, and F1-score, respectively. Ultimately, for the classification results of the reference structure, the long text 'title + abstract + TextRank full text (except the title and abstract)' outperforms most short text 'title + abstract' without WordNet fusion. However, when WordNet is fused for the classification, the short text is mostly more accurate than the long text.
引用
收藏
页码:1367 / 1393
页数:27
相关论文
共 71 条
[1]   Global citation recommendation employing generative adversarial network [J].
Ali, Zafar ;
Qi, Guilin ;
Muhammad, Khan ;
Kefalas, Pavlos ;
Khusro, Shah .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 180
[2]   An overview and evaluation of citation recommendation models [J].
Ali, Zafar ;
Ullah, Irfan ;
Khan, Amin ;
Ullah Jan, Asim ;
Muhammad, Khan .
SCIENTOMETRICS, 2021, 126 (05) :4083-4119
[3]   Deep learning in citation recommendation models survey [J].
Ali, Zafar ;
Kefalas, Pavlos ;
Muhammad, Khan ;
Ali, Bahadar ;
Imran, Muhammad .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162
[4]   Paper recommendation based on heterogeneous network embedding [J].
Ali, Zafar ;
Qi, Guilin ;
Muhammad, Khan ;
Ali, Bahadar ;
Abro, Waheed Ahmed .
KNOWLEDGE-BASED SYSTEMS, 2020, 210
[5]   A graph-based taxonomy of citation recommendation models [J].
Ali, Zafar ;
Qi, Guilin ;
Kefalas, Pavlos ;
Abro, Waheed Ahmad ;
Ali, Bahadar .
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (07) :5217-5260
[6]   A Novel Deep Neural Network-Based Approach to Measure Scholarly Research Dissemination Using Citations Network [J].
Aljohani, Naif Radi ;
Fayoumi, Ayman ;
Hassan, Saeed-Ul .
APPLIED SCIENCES-BASEL, 2021, 11 (22)
[7]  
Azarafza M., 2020, C 3 INT C SCI ENG
[8]  
Beltagy I., 2020, ARXIV
[9]  
Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615
[10]  
Bhagavatula C., 2018, ARXIV