Random Indexing and Modified Random Indexing based approach for extractive text summarization

被引:8
|
作者
Chatterjee, Niladri [1 ]
Sahoo, Pramod Kumar [1 ,2 ]
机构
[1] Indian Inst Technol Delhi, Dept Math, New Delhi 110016, India
[2] Def Res & Dev Org, Inst Syst Studies & Anal, Delhi 110054, India
来源
COMPUTER SPEECH AND LANGUAGE | 2015年 / 29卷 / 01期
关键词
Word Space Model; Random Indexing; PageRank; Convolution; Modified Random Indexing; INFORMATION;
D O I
10.1016/j.csl.2014.07.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random Indexing based extractive text summarization has already been proposed in literature. This paper looks at the above technique in detail, and proposes several improvements. The improvements are both in terms of formation of index (word) vectors of the document, and construction of context vectors by using convolution instead of addition operation on the index vectors. Experiments have been conducted using both angular and linear distances as metrics for proximity. As a consequence, three improved versions of the algorithm, viz. RISUM, RISUM+ and MRISUM were obtained. These algorithms have been applied on DUC 2002 documents, and their comparative performance has been studied. Different ROUGE metrics have been used for performance evaluation. While RISUM and RISUM+ perform almost at par, MRISUM is found to outperform both RISUM and RISUM+ significantly. MRISUM also outperforms LSA+TRM based summarization approach. The study reveals that all the three Random Indexing based techniques proposed in this study produce consistent results when linear distance is used for measuring proximity. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:32 / 44
页数:13
相关论文
共 50 条
  • [41] Extractive text summarization based on selectivity ranking
    University of Rijeka, Department of Informatics, Rijeka, Croatia
    不详
    Int. Conf. INnov. Intell. Syst. Appl., INISTA - Proc., 2021,
  • [42] Automatic Feature Detection and Clustering Using Random Indexing
    Nakouri, Haifa
    Limam, Mohamed
    IMAGE AND SIGNAL PROCESSING, ICISP 2014, 2014, 8509 : 586 - 593
  • [43] Novel chaotic random memory indexing steganography on FPGA
    Ismail, Samar M.
    Ghidan, Abdelrahman M.
    Zaki, Peter W.
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2020, 125
  • [44] Extractive text summarization using deep learning approach
    Yadav A.K.
    Singh A.
    Dhiman M.
    Vineet
    Kaundal R.
    Verma A.
    Yadav D.
    International Journal of Information Technology, 2022, 14 (5) : 2407 - 2415
  • [45] Indexing-Based Classification: An Approach Toward Classifying Text Documents
    Maheshan, M. S.
    Harish, B. S.
    Revanasiddappa, M. B.
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, INDIA 2017, 2018, 672 : 894 - 902
  • [46] A Frequency-Driven Approach for Extractive Text Summarization
    Zadgaonkar, Ashwini, V
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 37 - 43
  • [47] Extractive Arabic Text Summarization Using Modified PageRank Algorithm
    Elbarougy, Reda
    Behery, Gamal
    El Khatib, Akram
    EGYPTIAN INFORMATICS JOURNAL, 2020, 21 (02) : 73 - 81
  • [48] Content-based image retrieval by indexing random subwindows with randomized trees
    Maree, Raphael
    Geurts, Pierre
    Wehenke, Louis
    COMPUTER VISION - ACCV 2007, PT II, PROCEEDINGS, 2007, 4844 : 611 - +
  • [49] Automatic Extractive Text Summarization Based on Fuzzy Logic: A Sentence Oriented Approach
    Hannah, M. Esther
    Geetha, T. V.
    Mukherjee, Saswati
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT I, 2011, 7076 : 530 - +
  • [50] A Framework for Extractive Text Summarization Based on Deep Learning Modified Neural Network Classifier
    Muthu, Balaanand
    Sivaparthipan, C. B.
    Kumar, Priyan Malarvizhi
    Kadry, Seifedine Nimer
    Hsu, Ching-Hsien
    Sanjuan, Oscar
    Gonzalez Crespo, Ruben
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (03)