A Determinantal Point Process Based Novel Sampling Method of Abstractive Text Summarization

被引:0
作者
Shen, Jianbin [1 ]
Xuan, Junyu [2 ]
Liang, Christy [1 ]
机构
[1] Univ Technol Sydney, Visualisat Inst, Ultimo, NSW, Australia
[2] Univ Technol Sydney, Australian Artificial Intelligence Inst AAII, Ultimo, NSW, Australia
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
基金
澳大利亚研究理事会;
关键词
Abstractive text summarization; determinantal point process;
D O I
10.1109/IJCNN54540.2023.10191958
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years abstractive text summarization (ATS) research has made considerable progress attributed to two key improvements, deep neural modeling and likelihood estimation based sampling, in the end-to-end optimization training. While modeling has grounded on a few de facto highly capable base models within encoder-decoder architecture, novel sampling ideas, such as random masking classification and generative prediction by unsupervised learning, have also been explored. They aim at improving prior knowledge, particularly of language modeling for downstream tasks. It has led to the notable performance gain of ATS. But several challenges remain, for example, undesirable word repeats. In this paper, we propose a determinantal point process (DPP) based novel sampling method to address the issue. It can be easily integrated with the existing ATS models. Our experiments and subsequent analysis have revealed that the adopted models trained by our sampling method reduce undesirable word repeats and improve word coverage while achieving competitive ROUGE scores.
引用
收藏
页数:8
相关论文
共 36 条
  • [1] Aghajanyan Armen, 2020, ARXIV200803156CSSTAT
  • [2] Eynard-Mehta theorem, schur process, and their pfaffian analogs
    Borodin, A
    Rains, EM
    [J]. JOURNAL OF STATISTICAL PHYSICS, 2005, 121 (3-4) : 291 - 317
  • [3] From Word to Sense Embeddings: A Survey on Vector Representations of Meaning
    Camacho-Collados, Jose
    Pilehvar, Mohammad Taher
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 63 : 743 - 788
  • [4] Bound energies and states of the Yukawa potential through matrix representation
    Cho, Soyeon
    Yoon, Jin-Hee
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS E, 2019, 28 (11):
  • [5] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [6] Ethayarajh K, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P55
  • [7] Fernandes Patrick, 2021, ARXIV181101824CSSTAT
  • [8] Graff D., 2003, Linguistic Data Consortium, V4, P34
  • [9] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [10] Gulcehre Caglar, 2016, ARXIV160308148CS