Warm-Starting for Improving the Novelty of Abstractive Summarization

被引:0
|
作者
Alomari, Ayham [1 ,2 ]
Al-Shamayleh, Ahmad Sami
Idris, Norisma [3 ]
Qalid Md Sabri, Aznul [4 ]
Alsmadi, Izzat [5 ]
Omary, Danah [6 ]
机构
[1] Appl Sci Private Univ, Fac Informat Technol, Dept Comp Sci, Amman 11931, Jordan
[2] Middle East Univ, MEU Res Unit, Amman 11831, Jordan
[3] Al Ahliyya Amman Univ, Dept Data Sci & Artificial Intelligence, Amman 19328, Jordan
[4] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Artificial Intelligence, Kuala Lumpur 50603, Malaysia
[5] Texas A&M Univ San Antonio, Dept Comp & Cybersecur, San Antonio, TX 78224 USA
[6] Univ North Texas, Dept Elect Engn, Denton, TX 76210 USA
关键词
Abstractive summarization; novelty; warm-started models; deep learning; metrics;
D O I
10.1109/ACCESS.2023.3322226
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Abstractive summarization is distinguished by using novel phrases that are not found in the source text. However, most previous research ignores this feature in favour of enhancing syntactical similarity with the reference. To improve novelty aspects, we have used multiple warm-started models with varying encoder and decoder checkpoints and vocabulary. These models are then adapted to the paraphrasing task and the sampling decoding strategy to further boost the levels of novelty and quality. In addition, to avoid relying only on the syntactical similarity assessment, two additional abstractive summarization metrics are introduced: 1) NovScore: a new novelty metric that delivers a summary novelty score; and 2) NSSF: a new comprehensive metric that ensembles Novelty, Syntactic, Semantic, and Faithfulness features into a single score to simulate human assessment in providing a reliable evaluation. Finally, we compare our models to the state-of-the-art sequence-to-sequence models using the current and the proposed metrics. As a result, warm-starting, sampling, and paraphrasing improve novelty degrees by 2%, 5%, and 14%, respectively, while maintaining comparable scores on other metrics.
引用
收藏
页码:112483 / 112501
页数:19
相关论文
共 50 条
  • [31] A Relation Enhanced Model For Abstractive Dialogue Summarization
    Yi, Pengyao
    Liu, Ruifang
    2022 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, CYBERC, 2022, : 240 - 246
  • [32] Recurrent neural network for abstractive summarization of documents
    Bansal, Neha
    Sharma, Arun
    Singh, R. K.
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2020, 23 (01) : 65 - 72
  • [33] Abstractive Meeting Summarization as a Markov Decision Process
    Murray, Gabriel
    ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091 : 212 - 219
  • [34] Highlighted Word Encoding for Abstractive Text Summarization
    Lal, Daisy Monika
    Singh, Krishna Pratap
    Tiwary, Uma Shanker
    INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 77 - 86
  • [35] Abstractive Document Summarization without Parallel Data
    Nikolov, Nikola, I
    Hahnloser, Richard H. R.
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6638 - 6644
  • [36] Towards a New Hybrid Approach for Abstractive Summarization
    Jaafar, Younes
    Bouzoubaa, Karim
    ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 286 - 293
  • [37] Abstractive Summarization of Broadcast News Stories for Estonian
    Harm, Henry
    Alumae, Tanel
    BALTIC JOURNAL OF MODERN COMPUTING, 2022, 10 (03): : 511 - 524
  • [38] English-Arabic Text Translation and Abstractive Summarization Using Transformers
    Holiel, Heidi Ahmed
    Mohamed, Nancy
    Ahmed, Arwa
    Medhat, Walaa
    2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [39] KI-HABS: Key Information Guided Hierarchical Abstractive Summarization
    Zhang, Mengli
    Zhou, Gang
    Yu, Wanting
    Liu, Wenfen
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (12): : 4275 - 4291
  • [40] ATSSI: Abstractive Text Summarization using Sentiment Infusion
    Bhargava, Rupal
    Sharma, Yashvardhan
    Sharma, Gargi
    TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 404 - 411