A Simple Semantics and Topic-aware Method to Enhance Abstractive Summarization

被引:0
|
作者
Du, Jiangnan [1 ]
Fu, Xuan [2 ]
Li, Jianfeng [1 ]
Hou, Cuiqin [1 ]
Zhou, Qiyu [1 ]
Zheng, Hai-Tao [2 ,3 ]
机构
[1] Ping Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
[2] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[3] Pengcheng Lab, Shenzhen, Peoples R China
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
关键词
abstractive summarization; Transformer; semantic information; topic information;
D O I
10.1109/IJCNN54540.2023.10191441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Transformer-based abstractive summarization models have been proven to be effective and widely used in multiple domains. However, most of these Transformer-based methods are based on maximum likelihood estimation and they still focus on token-level optimization. In addition, the generated summary should have the same semantics as the gold summary. Therefore, we propose a new abstractive summarization model in this paper. Specifically, we optimize the traditional model from the semantic perspective, so that the semantics of the generated summary is more similar to that of the gold summary. We also add topic information so that more key information in the text can be retained in the model. We prove in the CNN/DailyMail dataset that the method proposed in this paper greatly improve the classic abstractive model based on BART, and achieve SOTA results in the SAMSUM dataset.
引用
收藏
页数:7
相关论文
共 27 条
  • [1] A novel abstractive summarization model based on topic-aware and contrastive learning
    Tang, Huanling
    Li, Ruiquan
    Duan, Wenhao
    Dou, Quansheng
    Lu, Mingyu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) : 5563 - 5577
  • [2] Integrating Topic-Aware Heterogeneous Graph Neural Network With Transformer Model for Medical Scientific Document Abstractive Summarization
    Khaliq, Ayesha
    Khan, Atif
    Awan, Salman Afsar
    Jan, Salman
    Umair, Muhammad
    Zuhairi, Megat F.
    IEEE ACCESS, 2024, 12 : 113855 - 113866
  • [3] A Topic-aware Summarization Framework with Different Modal Side Information
    Chen, Xiuying
    Li, Mingzhe
    Gao, Shen
    Cheng, Xin
    Yang, Qiang
    Zhang, Qishen
    Gao, Xin
    Zhang, Xiangliang
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1416 - 1425
  • [4] Topic Attentional Neural Network for Abstractive Document Summarization
    Liu, Hao
    Zheng, Hai-Tao
    Wang, Wei
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT II, 2019, 11440 : 70 - 81
  • [5] Key phrase aware transformer for abstractive summarization
    Liu, Shuaiqi
    Cao, Jiannong
    Yang, Ruosong
    Wen, Zhiyuan
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (03)
  • [6] Legal public opinion news abstractive summarization by incorporating topic information
    Yuxin Huang
    Zhengtao Yu
    Junjun Guo
    Zhiqiang Yu
    Yantuan Xian
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 2039 - 2050
  • [7] CATS: Customizable Abstractive Topic-based Summarization
    Bahrainian, Seyed Ali
    Zerveas, George
    Crestani, Fabio
    Eickhoff, Carsten
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 40 (01)
  • [8] Legal public opinion news abstractive summarization by incorporating topic information
    Huang, Yuxin
    Yu, Zhengtao
    Guo, Junjun
    Yu, Zhiqiang
    Xian, Yantuan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (09) : 2039 - 2050
  • [9] Keyword-based Augmentation Method to Enhance Abstractive Summarization for Legal Documents
    Huyen Nguyen
    Ding, Junhua
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND LAW, ICAIL 2023, 2023, : 437 - 441
  • [10] Domain-Aware Abstractive Text Summarization for Medical Documents
    Gigioli, Paul
    Sagar, Nikhita
    Voyles, Joseph
    Rao, Anand
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 1155 - 1162