Random Indexing and Modified Random Indexing based approach for extractive text summarization

被引:8
|
作者
Chatterjee, Niladri [1 ]
Sahoo, Pramod Kumar [1 ,2 ]
机构
[1] Indian Inst Technol Delhi, Dept Math, New Delhi 110016, India
[2] Def Res & Dev Org, Inst Syst Studies & Anal, Delhi 110054, India
来源
COMPUTER SPEECH AND LANGUAGE | 2015年 / 29卷 / 01期
关键词
Word Space Model; Random Indexing; PageRank; Convolution; Modified Random Indexing; INFORMATION;
D O I
10.1016/j.csl.2014.07.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random Indexing based extractive text summarization has already been proposed in literature. This paper looks at the above technique in detail, and proposes several improvements. The improvements are both in terms of formation of index (word) vectors of the document, and construction of context vectors by using convolution instead of addition operation on the index vectors. Experiments have been conducted using both angular and linear distances as metrics for proximity. As a consequence, three improved versions of the algorithm, viz. RISUM, RISUM+ and MRISUM were obtained. These algorithms have been applied on DUC 2002 documents, and their comparative performance has been studied. Different ROUGE metrics have been used for performance evaluation. While RISUM and RISUM+ perform almost at par, MRISUM is found to outperform both RISUM and RISUM+ significantly. MRISUM also outperforms LSA+TRM based summarization approach. The study reveals that all the three Random Indexing based techniques proposed in this study produce consistent results when linear distance is used for measuring proximity. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:32 / 44
页数:13
相关论文
共 50 条
  • [31] Evaluation of Technology Term Recognition with Random Indexing
    Zadeh, Behrang Q.
    Handschuh, Siegfried
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4027 - 4032
  • [32] A Novel Approach for Semantic Extractive Text Summarization
    Waseemullah
    Fatima, Zainab
    Zardari, Shehnila
    Fahim, Muhammad
    Andleeb Siddiqui, Maria
    Ibrahim, Ag. Asri Ag.
    Nisar, Kashif
    Naz, Laviza Falak
    APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [33] An Improvised Extractive Approach to Hindi Text Summarization
    Kumar, K. Vimal
    Yadav, Divakar
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, 2015, 339 : 291 - 300
  • [34] A Framework for Extractive Text Summarization using Semantic Graph Based Approach
    Ullah, Shofi
    Al Islam, A. B. M. Alim
    2019 6TH INTERNATIONAL CONFERENCE ON NETWORKING, SYSTEMS AND SECURITY (NSYSS 2019), 2019, : 48 - 55
  • [35] SumCR: A new subtopic-based extractive approach for text summarization
    Mei, Jian-Ping
    Chen, Lihui
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 31 (03) : 527 - 545
  • [36] A Similarity-Based Abstract Argumentation Approach to Extractive Text Summarization
    Ferilli, Stefano
    Pazienza, Andrea
    Angelastro, Sergio
    Suglia, Alessandro
    AI*IA 2017 ADVANCES IN ARTIFICIAL INTELLIGENCE, 2017, 10640 : 87 - 100
  • [37] SumCR: A new subtopic-based extractive approach for text summarization
    Jian-Ping Mei
    Lihui Chen
    Knowledge and Information Systems, 2012, 31 : 527 - 545
  • [38] An Abstract Argumentation-Based Approach to Automatic Extractive Text Summarization
    Ferilli, Stefano
    Pazienza, Andrea
    DIGITAL LIBRARIES AND MULTIMEDIA ARCHIVES, IRCDL 2018, 2018, 806 : 57 - 68
  • [39] Unsupervised Random Forest Indexing for Fast Action Search
    Yu, Gang
    Yuan, Junsong
    Liu, Zicheng
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 865 - 872
  • [40] Random Indexing Distributional Semantic Models for Croatian Language
    Jankovic, Vedrana
    Snajder, Jan
    Basic, Bojana Dalbelo
    TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 411 - 418