Automatic generation of related work through summarizing citations

被引:33
作者
Chen, Jingqiang [1 ,4 ]
Hai Zhuge [1 ,2 ,3 ,4 ]
机构
[1] Nanjing Univ Posts & Telecommun, Nanjing, Jiangsu, Peoples R China
[2] Guangzhou Univ, Guangzhou, Guangdong, Peoples R China
[3] Aston Univ, Birmingham, W Midlands, England
[4] Univ Chinese Acad Sci, Chinese Acad Sci, Key Lab Intelligent Informat Proc, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
citation; related work generation; Summarization; EXTRACTION;
D O I
10.1002/cpe.4261
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Related work is a component of a scientific paper, which introduces other researchers' relevant works and makes comparisons with the current author's work. Automatically generating the related work section of a writing paper provides a tool for researchers to accomplish the related work section efficiently without missing related works. This paper proposes an approach to automatically generating a related work section by comparing the main text of the paper being written with the citations of other papers that cite the same references. Our approach first collects the papers that cite the reference papers of the paper being written and extracts the corresponding citation sentences to form a citation document. It then extracts keywords from the citation document and the paper being written and constructs a graph of the keywords. Once the keywords that discriminate the two documents are determined, the minimum Steiner tree that covers the discriminative keywords and the topic keywords is generated. The summary is generated by extracting the sentences covering the Steiner tree. According to ROUGE evaluations, the experiments show that the citations are suitable for related work generation and our approach outperforms the three baseline methods of MEAD, LexRank, and ReWoS. This work verifies the general summarization method based on connotation and extension through citation.
引用
收藏
页数:12
相关论文
共 45 条
  • [1] [Anonymous], 2016, MULTIDIMENSIONAL SUM
  • [2] Extractive summarization of multi-party meetings through discourse segmentation
    Bokaei, Mohammad Hadi
    Sameti, Hossein
    Liu, Yang
    [J]. NATURAL LANGUAGE ENGINEERING, 2016, 22 (01) : 41 - 72
  • [3] Chen JQ, 2014, INT CONF SEMANT, P108, DOI 10.1109/SKG.2014.16
  • [4] Summarization of scientific documents by detecting common facts in citations
    Chen, Jingqiang
    Zhuge, Hai
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 32 : 246 - 252
  • [5] Chen Y, 2011, INT C COMP AID IND D, P933
  • [6] Chen YN, 2013, SPOKEN LANGUAGE TECH, P461
  • [7] Cheng SW, 2012, J ACAD LANG LEARN, V6, pA1
  • [8] FLUX-CiM: Flexible Unsupervised Extraction of Citation Metadata
    Cortez, Eli
    da Silva, Altigran S.
    Goncalves, Marcos Andre
    Mesquita, Filipe
    de Moura, Edleno S.
    [J]. PROCEEDINGS OF THE 7TH ACM/IEE JOINT CONFERENCE ON DIGITAL LIBRARIES: BUILDING & SUSTAINING THE DIGITAL ENVIRONMENT, 2007, : 215 - +
  • [9] Reference metadata extraction using a hierarchical knowledge representation framework
    Day, Min-Yuh
    Tsai, Richard Tzong-Han
    Sung, Cheng-Lung
    Hsieh, Chiu-Chen
    Lee, Cheng-Wei
    Wu, Shih-Hung
    Wu, Kun-Pin
    ong, Chorng-Shy Ong
    Hsu, Wen-Lian
    [J]. DECISION SUPPORT SYSTEMS, 2007, 43 (01) : 152 - 167
  • [10] Day MY, 2005, PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, P50