Particle swarm optimization model for Hindi text summarization

被引:0
作者
Jain, Rekha [1 ]
Raja, Linesh [1 ]
Sharma, Sandeep Kumar [2 ]
Bhatt, Devershi Pallavi [1 ]
机构
[1] Manipal Univ Jaipur, Dept Comp Applicat, Jaipur 303007, Rajasthan, India
[2] Manipal Univ Jaipur, Dept Comp & Commun Engn, Jaipur 303007, Rajasthan, India
关键词
PSO model; Text summarization; Extractive text summarization; Ranking based techniques; Optimized text summarization; Hindi; TF-IDF;
D O I
10.47974/JIOS-1609
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Text Summarization is one of the techniques that shorten the original text without vanishing its information as well as meaning. A lot of algorithms exist for text summarization. Two approaches namely Abstractive Text Summarization and Extractive Text Summarization are used for this purpose. In Abstractive text summarization, the entire document is regenerated using a few lines. Whereas in Extractive Text Summarization sentences are filtered based on some ranks assigned to them by a specific algorithm. A lot of work has already been done in languages like English, Chinese etc. In this paper, the authors propose the summarization of Hindi text using the Particle Swarm Optimization model. Initially, the text in the Hindi language is summarized using a ranking-based technique then PSO (Particle Swarm Optimization) is applied to have an optimized summary of the text. One of the ranking-based techniques i.e. TF-IDF is introduced. Implementation of the proposed Systems is initially discussed in five steps- preprocessing, feature extraction, ranks generation, post-processing and optimized summarization using PSO. At the end, results are shown in terms of an optimized summary of text in a specific language. This system can be implemented in any standard language, but Hindi is selected for practical implementation because very few research work is done in the Hindi Language.
引用
收藏
页码:839 / 850
页数:12
相关论文
共 28 条
  • [1] Alaliyat S., 2014, Optimisation of boids swarm model based on genetic algorithm and particle swarm optimisation algorithm
  • [2] [Anonymous], 2022, Indian Language News Text Summarization Dataset
  • [3] Araly B., 2011, Automated extractive single-document summarization: Beating the baselines with a new approach, P268
  • [4] Summarizing scientiflc texts: Experiments with extractive summarizers
    Balage Filho, Pedro Paulo
    Salguelro Pardo, Thiago Alexandre
    Gracas Volpe Nunes, Maria Das
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 520 - 524
  • [5] Begum M., 2020, Candidate sentence selection for extractive text summarization
  • [6] Boudjehem D, 2017, J INFORM OPTIM SCI, V38, P481, DOI 10.1080/02522667.2016.1224467
  • [7] Chauhan Kushal, 2018, Unsupervised Text Summarization using Sentence Embeddings
  • [8] Chawla I, 2014, INT CONF CONTEMP, P376, DOI 10.1109/IC3.2014.6897203
  • [9] A Probabilistic Generative Framework for Extractive Broadcast News Speech Summarization
    Chen, Yi-Ting
    Chen, Berlin
    Wang, Hsin-Min
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (01): : 95 - 106
  • [10] Garbade M.J., 2018, A quick introduction to text summarization in machine learning