Binary Particle Swarm Optimization with an improved genetic algorithm to solve multi-document text summarization problem of Hindi documents

被引:14
作者
Aote, Shailendra S. [1 ]
Pimpalshende, Anjusha [2 ]
Potnurwar, Archana [3 ]
Lohi, Shantanu [4 ]
机构
[1] Shri Ramdeobaba Coll Engn & Management, Nagpur, India
[2] VNR Vignana Jyothi Inst Engn & Technol, Hyderabad, India
[3] Priyadarshini Coll Engn, Nagpur, India
[4] Govt Coll Engn, Amravati, India
关键词
Multi-document; Text summarization; Hindi; Swarm intelligence; Binary PSO; Improved genetic algorithm; PERFORMANCE; MODELS; GA;
D O I
10.1016/j.engappai.2022.105575
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic text summarization plays a vital role in text retrieval. However, very little interest, as well as attention to Hindi text summarization and the wide use of hybridization of an evolutionary and swarm -based method for solving optimization problems, motivates us to develop a technique for achieving better results. We have used a combination of the Title feature, Sentence length, Sentence position, Numerical Data, Thematic word, Proper Noun and Term frequency & Inverse Sentence Frequency to find the results. The proposed BPSO-IGA comprising Binary Particle Swarm Optimization (BPSO) and an improved genetic algorithm finds a well-formatted summary from multiple documents. The method for calculating the score of the sentence is proposed initially. A combination of BPSO and IGA finds the features' optimal scores and then is used to generate the final summary. The proposed method is compared with six well-known techniques, SummaRuNNer, ChunkRank, TGraph, UniRank, FF-SE, and MDS-ACS, on FIRE 2011 Hindi dataset. To strengthen the hypothesis that all seven features are required in the summary generation, the summary is also generated using only ONE and a combination of more than one feature. It is found that the best summary is generated when all seven features are considered. The BPSO-IGA performs best among all the methods for precision, recall, f-measure, cohesion, and non-redundancy. This research aims to design a multi-document extractive text summarizer for Hindi documents using a swarm intelligence and evolutionary algorithm-based hybrid approach.
引用
收藏
页数:11
相关论文
共 71 条
  • [1] Fuzzy evolutionary cellular learning automata model for text summarization
    Abbasi-ghalehtaki, Razieh
    Khotanlou, Hassan
    Esmaeilpour, Mansour
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2016, 30 : 11 - 26
  • [2] Abuobieda A, 2013, 2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), P244, DOI 10.1109/ICCEEE.2013.6633941
  • [3] Plant intelligence based metaheuristic optimization algorithms
    Akyol, Sinem
    Alatas, Bilal
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2017, 47 (04) : 417 - 462
  • [4] Al-Saleh Asma Bader, 2018, COLING 2018 27 INT C, P734
  • [5] A Physics Based Novel Approach for Travelling Tournament Problem: Optics Inspired Optimization
    Alatas, B.
    Bingol, H.
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2019, 48 (03): : 373 - 388
  • [6] COMPARATIVE ASSESSMENT OF LIGHT-BASED INTELLIGENT SEARCH AND OPTIMIZATION ALGORITHMS
    Alatas, Bilal
    Bingol, Harun
    [J]. LIGHT & ENGINEERING, 2020, 28 (06): : 51 - 59
  • [7] Alguliev Rasim, 2009, Intelligent Information Management, V1, P128, DOI 10.4236/iim.2009.12019
  • [8] MCMR: Maximum coverage and minimum redundant text summarization model
    Alguliev, Rasim M.
    Aliguliyev, Ramiz M.
    Hajirahimova, Makrufa S.
    Mehdiyev, Chingiz A.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) : 14514 - 14522
  • [9] An unsupervised approach to generating generic summaries of documents
    Alguliyev, Rasim M.
    Aliguliyev, Ramiz M.
    Isazade, Nijat R.
    [J]. APPLIED SOFT COMPUTING, 2015, 34 : 236 - 250
  • [10] CLUSTERING TECHNIQUES AND DISCRETE PARTICLE SWARM OPTIMIZATION ALGORITHM FOR MULTI-DOCUMENT SUMMARIZATION
    Aliguliyev, Ramiz M.
    [J]. COMPUTATIONAL INTELLIGENCE, 2010, 26 (04) : 420 - 448