Query-oriented text summarization based on hypergraph transversals

被引:40
作者
Van Lierde, H. [1 ]
Chow, Tommy W. S. [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Kowloon Tong, 83 Tat Chee Av, Hong Kong, Peoples R China
关键词
Query-oriented text summarization; Hypergraph theory; Hypergraph transversal; Sentence clustering; Submodular set functions; ARCHETYPAL ANALYSIS; DOCUMENTS; GRAPH;
D O I
10.1016/j.ipm.2019.03.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rise in the amount of textual resources available on the Internet has created the need for tools of automatic document summarization. The main challenges of query-oriented extractive summarization are (1) to identify the topics of the documents and (2) to recover query-relevant sentences of the documents that together cover these topics. Existing graph- or hypergraph-based summarizers use graph-based ranking algorithms to produce individual scores of relevance for the sentences. Hence, these systems fail to measure the topics jointly covered by the sentences forming the summary, which tends to produce redundant summaries. To address the issue of selecting non-redundant sentences jointly covering the main query-relevant topics of a corpus, we propose a new method using the powerful theory of hypergraph transversals. First, we introduce a new topic model based on the semantic clustering of terms in order to discover the topics present in a corpus. Second, these topics are modeled as the hyperedges of a hypergraph in which the nodes are the sentences. A summary is then produced by generating a transversal of nodes in the hypergraph. Algorithms based on the theory of submodular functions are proposed to generate the transversals and to build the summaries. The proposed summarizer outperforms existing graph- or hypergraph-based summarizers by at least 6% of ROUGE-SU4 F-measure on DUC 2007 dataset. It is moreover cheaper than existing hypergraph-based summarizers in terms of computational time complexity.
引用
收藏
页码:1317 / 1338
页数:22
相关论文
共 22 条
[11]   Graph-based abstractive biomedical text summarization [J].
Givchi, Azadeh ;
Ramezani, Reza ;
Baraani-Dastjerdi, Ahmad .
JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 132
[12]   BiDETS: Binary Differential Evolutionary based Text Summarization [J].
Aljahdali, Hani Moetque ;
Ahmed, Ahmed Hamza Osman ;
Abuobieda, Albaraa .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (01) :259-271
[13]   Extractive text summarization using clustering-based topic modeling [J].
Belwal, Ramesh Chandra ;
Rai, Sawan ;
Gupta, Atul .
SOFT COMPUTING, 2023, 27 (07) :3965-3982
[14]   Lexical Similarity Based Query-Focused Summarization Using Artificial Immune Systems [J].
Katiyar, Sulabh ;
Borgohain, Samir .
ARTIFICIAL INTELLIGENCE PERSPECTIVES AND APPLICATIONS (CSOC2015), 2015, 347 :287-296
[15]   A New LSA and Entropy-Based Approach for Automatic Text Document Summarization [J].
Yadav, Chandra ;
Sharan, Aditi .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2018, 14 (04) :1-32
[16]   A new sentence similarity measure and sentence based extractive technique for automatic text summarization [J].
Aliguliyev, Ramiz M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) :7764-7772
[17]   Frequent item-set mining and clustering based ranked biomedical text summarization [J].
Gupta, Supriya ;
Sharaff, Aakanksha ;
Nagwani, Naresh Kumar .
JOURNAL OF SUPERCOMPUTING, 2023, 79 (01) :139-159
[18]   Query-based multi-documents summarization using linguistic knowledge and content word expansion [J].
Abdi, Asad ;
Idris, Norisma ;
Alguliyev, Rasim M. ;
Aliguliyev, Ramiz M. .
SOFT COMPUTING, 2017, 21 (07) :1785-1801
[19]   Learn2Sum: A new approach to unsupervised text summarization based on topic modeling [J].
Beldi, Amal ;
Sassi, Salma ;
Jemai, Abedrazzek .
PROCEEDINGS OF 2022 14TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS, MEDES 2022, 2022, :136-143
[20]   Query Focused Multi-document Summarization Based on Five-Layered Graph and Universal Paraphrastic Embeddings [J].
Canhasi, Ercan .
ARTIFICIAL INTELLIGENCE TRENDS IN INTELLIGENT SYSTEMS, CSOC2017, VOL 1, 2017, 573 :220-228