TALKSUMM: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks

被引:0
|
作者
Lev, Guy [1 ]
Shmueli-Scheuer, Michal [1 ]
Herzig, Jonathan [1 ]
Jerbi, Achiya [1 ]
Konopnicki, David [1 ]
机构
[1] IBM Res, Haifa, Israel
来源
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019) | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, no large-scale training data is available for the task of scientific paper summarization. In this paper, we propose a novel method that automatically generates summaries for scientific papers, by utilizing videos of talks at scientific conferences. We hypothesize that such talks constitute a coherent and concise description of the papers' content, and can form the basis for good summaries. We collected 1716 papers and their corresponding videos, and created a dataset of paper summaries. A model trained on this dataset achieves similar performance as models trained on a dataset of summaries created manually. In addition, we validated the quality of our summaries by human experts.
引用
收藏
页码:2125 / 2131
页数:7
相关论文
共 10 条
  • [1] Event-based summarization method for scientific literature
    Zhang, Junsheng
    Li, Kun
    Yao, Changqing
    Sun, Yunchuan
    PERSONAL AND UBIQUITOUS COMPUTING, 2021, 25 (06) : 959 - 968
  • [2] Event-based summarization method for scientific literature
    Junsheng Zhang
    Kun Li
    Changqing Yao
    Yunchuan Sun
    Personal and Ubiquitous Computing, 2021, 25 : 959 - 968
  • [3] Semantic Annotation Model and Method Based on Internet Open Dataset
    Gao, Xin
    Wang, Yansong
    Wang, Fang
    Zhang, Baoqun
    Hu, Caie
    Wang, Jian
    Ma, Longfei
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2025, 21 (01)
  • [4] Influence Visualization of Scientific Paper Through Flow-based Citation Network Summarization
    Su, Yue
    Sun, Sibai
    Xuan, Yuan
    Shi, Lei
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1652 - 1655
  • [5] Dataset construction method of cross-lingual summarization based on filtering and text augmentation
    Pan H.
    Xi Y.
    Wang L.
    Nan Y.
    Su Z.
    Cao R.
    PeerJ Computer Science, 2023, 9
  • [6] Dataset construction method of cross-lingual summarization based on filtering and text augmentation
    Pan, Hangyu
    Xi, Yaoyi
    Wang, Ling
    Nan, Yu
    Su, Zhizhong
    Cao, Rong
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [7] Study on the Scientific Assessment Method of Test Paper Based on Probability Theory
    Song, Chen
    Zhai, Yu-Xiao
    Zhang, Wei
    2013 2ND INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE AND EDUCATION (ICSSE 2013), PT 3, 2013, 48 : 215 - 221
  • [8] Constrained Region Selection Method Based on Configuration Space for Visualization in Scientific Dataset Search
    Takeuchi, Shin'ichi
    Sugiura, Komei
    Akahoshi, Yuhei
    Zettsu, Koji
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2191 - 2200
  • [9] A method of annotation extraction from paper documents using alignment based on local arrangements of feature points
    Nakai, Tornohiro
    Kise, Koichi
    Iwamura, Masakazu
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 23 - 27
  • [10] scADCA: An Anomaly Detection-Based scRNA-seq Dataset Cell Type Annotation Method for Identifying Novel Cells
    Shi, Yongle
    Ma, Yibing
    Chen, Xiang
    Gao, Jie
    CURRENT BIOINFORMATICS, 2024,