CLUSTERING TECHNIQUES AND DISCRETE PARTICLE SWARM OPTIMIZATION ALGORITHM FOR MULTI-DOCUMENT SUMMARIZATION

被引:31
|
作者
Aliguliyev, Ramiz M. [1 ]
机构
[1] Natl Acad Sci, Inst Informat Technol, Dept 13, AZ-1141 Baku, Azerbaijan
关键词
text mining; sentence clustering; generic multi-document summarization; sentence extractive technique; discrete Particle Swarm Optimization algorithm; TEXT; SENTENCES; LEXRANK;
D O I
10.1111/j.1467-8640.2010.00365.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-document summarization is a process of automatic creation of a compressed version of a given collection of documents that provides useful information to users. In this article we propose a generic multi-document summarization method based on sentence clustering. We introduce five clustering methods, which optimize various aspects of intra-cluster similarity, inter-cluster dissimilarity and their combinations. To solve the clustering problem a modification of discrete particle swarm optimization algorithm has been proposed. The experimental results on open benchmark data sets from DUC2005 and DUC2007 show that our method significantly outperforms the baseline methods for multi-document summarization.
引用
收藏
页码:420 / 448
页数:29
相关论文
共 50 条
  • [1] Extractive Multi-Document Text Summarization by Using Binary Particle Swarm Optimization
    Potnurwar, Archana
    Pimpalshende, Anjusha
    Aote, Shailendra S.
    Bongirwar, Vrusbali
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 32 - 34
  • [2] Binary Particle Swarm Optimization with an improved genetic algorithm to solve multi-document text summarization problem of Hindi documents
    Aote, Shailendra S.
    Pimpalshende, Anjusha
    Potnurwar, Archana
    Lohi, Shantanu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [3] Multi-document summarization based on unsupervised clustering
    Ji, Paul
    INFORMATION RETRIEVAL TECHNOLOLGY, PROCEEDINGS, 2006, 4182 : 560 - 566
  • [4] A Sentence-Clustering Based Algorithm to Extracting Multi-document Summarization
    Chen, Dinglei
    Wang, Wei
    PROCEEDINGS OF 2008 INTERNATIONAL COLLOQUIUM ON ARTIFICIAL INTELLIGENCE IN EDUCATION, 2008, : 93 - 97
  • [5] Multi-document Summarization Based on Sentence Clustering
    Zheng, Hai-Tao
    Gong, Shu-Qin
    Chen, Hao
    Jiang, Yong
    Xia, Shu-Tao
    NEURAL INFORMATION PROCESSING (ICONIP 2014), PT II, 2014, 8835 : 429 - 436
  • [6] Multi-Document Summarization Using Sentence Clustering
    Gupta, Virendra Kumar
    Siddiqui, Tanveer J.
    4TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2012), 2012,
  • [7] Extractive multi-document text summarization using dolphin swarm optimization approach
    Srivastava, Atul Kumar
    Pandey, Dhiraj
    Agarwal, Alok
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (07) : 11273 - 11290
  • [8] Extractive multi-document text summarization using dolphin swarm optimization approach
    Atul Kumar Srivastava
    Dhiraj Pandey
    Alok Agarwal
    Multimedia Tools and Applications, 2021, 80 : 11273 - 11290
  • [9] Genetic algorithm based multi-document summarization
    Liu, Dexi
    He, Yanxiang
    Ji, Donghong
    Yang, Hua
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 1140 - 1144
  • [10] Multi-document summarization using CS-ABC optimization algorithm
    Kumar K.C.
    Nagalla S.
    Kumar, K. Chandra (chandrakumark2381@gmail.com), 1600, European Alliance for Innovation (07):