Learning with fuzzy hypergraphs: A topical approach to query-oriented text summarization

被引:21
|
作者
Van Lierde, Hadrien [1 ]
Chow, Tommy W. S. [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Kowloon Tong, 83 Tat Chee Av, Hong Kong, Peoples R China
关键词
Automatic text summarization; Fuzzy graphs; Probabilistic topic models; Hierarchical Dirichlet process; Personalized PageRank; Submodular set functions; RANKING;
D O I
10.1016/j.ins.2019.05.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing graph-based methods for extractive document summarization represent sentences of a corpus as the nodes of a graph in which edges depict relationships of lexical similarity between sentences. This approach fails to capture semantic similarities between sentences when they express a similar information but have few words in common and are thus lexically dissimilar. To overcome this issue, we propose to extract semantic similarities based on topical representations of sentences. Inspired by the Hierarchical Dirichlet Process, we propose a topic model to infer topic distributions of sentences. As each topic defines a semantic connection among sentences with a certain degree of membership for each sentence, we propose a fuzzy hypergraph model in which nodes are sentences and fuzzy hyperedges are topics. To produce an informative summary, we extract a set of sentences from the corpus by simultaneously maximizing their relevance to a user-defined query, their centrality in the fuzzy hypergraph and their coverage of topics present in the corpus. We formulate an algorithm building on the theory of submodular functions to solve the associated optimization problem. A thorough comparative analysis with other graph-based summarizers demonstrates the superiority of our method in terms of content coverage of the summaries. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:212 / 224
页数:13
相关论文
共 50 条
  • [21] High quality information extraction and query-oriented summarization for automatic query-reply in social network
    Peng, Min
    Gao, Binlong
    Zhu, Jiahui
    Huang, Jiajia
    Yuan, Mengting
    Li, Fei
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 44 : 92 - 101
  • [22] Exploring hypergraph-based semi-supervised ranking for query-oriented summarization
    Wang, Wei
    Li, Sujian
    Li, Jiwei
    Li, Wenjie
    Wei, Furu
    INFORMATION SCIENCES, 2013, 237 : 271 - 286
  • [23] A cluster-sensitive graph model for query-oriented multi-document summarization
    Wei, Furu
    Li, Wenjie
    Lu, Qin
    He, Yanxiang
    ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 446 - +
  • [24] QODM: A Query-Oriented Data Modeling Approach for NoSQL Databases
    Li, Xiang
    Ma, Zhiyi
    Chen, Hongjie
    PROCEEDINGS OF 2014 IEEE WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS (WARTIA), 2014, : 338 - 345
  • [26] A Query-Sensitive Graph-Based Sentence Ranking Algorithm for Query-Oriented Multi-Document Summarization .
    Wei, Furu
    He, Yanxiang
    Li, Wenjie
    Lu, Qin
    2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 9 - +
  • [27] Automatic Extractive Text Summarization Based on Fuzzy Logic: A Sentence Oriented Approach
    Hannah, M. Esther
    Geetha, T. V.
    Mukherjee, Saswati
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT I, 2011, 7076 : 530 - +
  • [28] CLIOQUERY: Interactive Query-oriented Text Analytics for Comprehensive Investigation of Historical News Archives
    Handler, Abram
    Mahyar, Narges
    O'Connor, Brendan
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2022, 12 (03)
  • [29] Query-oriented topical influential users detection for top-k trending topics in twitter
    Gomasta, Sarmistha Sarna
    Dhali, Aditi
    Anwar, Md Musfique
    Sarker, Iqbal H.
    APPLIED INTELLIGENCE, 2022, 52 (12) : 13415 - 13434
  • [30] Beyond Query-Oriented Highlighting: Investigating the Effect of Snippet Text Highlighting in Search User Behavior
    Zhang, Hui
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018