Disentangling Specificity for Abstractive Multi-document Summarization

被引：0

作者：

Ma, Congbo ^{[1
]}

Zhang, Wei Emma ^{[2
]}

Wang, Hu ^{[2
]}

Zhuang, Haojie ^{[2
]}

Guo, Mingyu ^{[2
]}

机构：

[1] Macquarie Univ, Sydney, NSW, Australia

[2] Univ Adelaide, Adelaide, SA, Australia

来源：

2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024 | 2024年

基金：

澳大利亚研究理事会;

关键词：

Multi-document summarization; Deep neural network; Transformer;

D O I：

10.1109/IJCNN60899.2024.10651001

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-document summarization (MDS) generates a summary from a document set. Each document in a set describes topic-relevant concepts, while per document also has its unique contents. However, the document specificity receives little attention from existing MDS approaches. Neglecting specific information for each document limits the comprehensiveness of the generated summaries. To solve this problem, in this paper, we propose to disentangle the specific content from documents in one document set. The document-specific representations, which are encouraged to be distant from each other via a proposed orthogonal constraint, are learned by the specific representation learner. We provide extensive analysis and have interesting findings that specific information and document set representations contribute distinctive strengths and their combination yields a more comprehensive solution for the MDS. Also, we find that the common (i.e. shared) information could not contribute much to the overall performance under the MDS settings. Implemetation codes are available at https://github.com/congboma/DisentangleSum.

引用

页数：8

共 28 条

[1]

Carbonell J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P335, DOI 10.1145/290941.291025

[2] LexRank: Graph-based lexical centrality as salience in text summarization [J].

Erkan, G ;

Radev, DR .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 :457-479

[3]

Ernst O, 2022, NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, P1765

[4]

Fabbri AR, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1074

[5]

Grusky Max, 2018, P 2018 C N AM CHAPTE, P708, DOI [DOI 10.18653/V1/N18-1065, 10.18653/v1/N18]

[6]

Jin H., 2020, P 58 ANN M ASS COMPU, P6244, DOI 10.18653/v1/2020.acl-main.556

[7] User Interests Driven Collaborative Cloud-Edge-Browser Architecture for WebBIM Visualization [J].

Li, Ke ;

Zhang, Qian ;

Zhao, Hantao ;

Jia, Jinyuan .

PROCEEDINGS OF THE 25TH ACM CONFERENCE ON 3D WEB TECHNOLOGY, WEB3D 2020, 2020,

[8]

Li W., 2020, P 58 ANN M ASS COMP, P6232

[9]

Lin Chin-Yew, 2004, Text summarization branches out, P74

[10]

Liu SQ, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P5021

← 1 2 3 →