Research on abstractive text summarization based on triplet information guidance

被引:0
作者
Zhang, Yunzuo [1 ,2 ]
Li, Yi [1 ]
机构
[1] School of Information Science and Technology, Shijiazhuang Tiedao University, Shijiazhuang
[2] Key Laboratory of Hebei Province on Electromagnetic Environmental Effects and Information Processing, Shijiazhuang
来源
Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics | 2024年 / 50卷 / 12期
基金
中国国家自然科学基金;
关键词
abstractive text summarization; fact consistency; fact fusion mechanism; Transformer; triples;
D O I
10.13700/j.bh.1001-5965.2022.0896
中图分类号
学科分类号
摘要
This study introduces the text summarizing model SPOATS, which is driven by fact triples, to solve the issue that the current abstractive text summarization models do not entirely exploit the factual information of text in decoding. The model is based on a Transformer structure, which contains a double encoder capable of extracting facts and a decoder for combining factual features. To begin, the LTP-BiLSTM-GAT (LBiG) model is built and paired with the optimal factual triple selection technique suggested in this paper. The optimal factual triples are then extracted from the unstructured Chinese text to acquire the feature encoding of factual information. Then, the improved S-BERT model is used to represent the original text at the sentence-level vector to obtain the semantic rich sentence encoding. Finally, an attention-based fact fusion mechanism is designed to fuse the dual-encoding features, which can improve the ability of the model to select factual information in the decoding stage. The experimental results show that the proposed model has improved the value of ERPG by 2.0% compared to the baseline model on the dataset LCSTS, and the summary quality has been significantly improved. © 2024 Beijing University of Aeronautics and Astronautics (BUAA). All rights reserved.
引用
收藏
页码:3677 / 3685
页数:8
相关论文
共 37 条
  • [1] REINSEL D, GANTZ J, RYDNING J., Data age 2025, (2017)
  • [2] WAZERY Y M, SALEH M E, ALHARBI A, Et al., Abstractive Arabic text summarization based on deep learning, Computational Intelligence and Neuroscience, 2022, (2022)
  • [3] DENG L, HU P, LI X H., Abstractiing biomedical documents with knowledge enhancement, Data Analysis and Knowledge Discovery, 6, 11, pp. 1-12, (2022)
  • [4] YAN W Y, GUO J J, YU Z T, Et al., Case propensity extract summarization based on case attribute perception, Journal of Shanxi University(Natural Science Edition), 44, 3, pp. 445-453, (2021)
  • [5] CAI Z X, SUN J W., News text summarization model integrating pointer network, Journal of Chinese Computer Systems, 42, 3, pp. 462-466, (2021)
  • [6] HUANG H N, CHEN Z M, XU C, Et al., Automatic summarization model of aerospace news based on domain concept graph, Journal of Beijing University of Aeronautics and Astronautics, 50, 1, pp. 317-327, (2024)
  • [7] WU H R, GUO W, DENG Y, Et al., Review of semantic analysis techniques of agricultural texts, Transactions of the Chinese Society for Agricultural Machinery, 53, 5, pp. 1-16, (2022)
  • [8] LI J P, ZHANG C, CHEN X J, Et al., Survey on automatic text summarization, Journal of Computer Research and Development, 58, 1, pp. 1-21, (2021)
  • [9] ZHU Y Q, ZHAO P, ZHAO F F, Et al., Survey on abstractive text summarization technologies based on deep learning, Computer Engineering, 47, 11, pp. 11-21, (2021)
  • [10] CHO K, VAN MERRIENBOER B, GULCEHRE C, Et al., Learning phrase representations using RNN encoder-decoder for statistical machinetranslation