Extractive Document Summarization Using a Supervised Learning Approach

被引:0
作者
Charitha, Sangaraju [1 ]
Chittaragi, Nagaratna B. [1 ]
Koolagudi, Shashidhar G. [1 ]
机构
[1] NITK, Dept CSE, Surathkal 575025, India
来源
PROCEEDINGS OF 2018 IEEE DISTRIBUTED COMPUTING, VLSI, ELECTRICAL CIRCUITS AND ROBOTICS (DISCOVER) | 2018年
关键词
Text summarization; Convolutional Neural Networks (CNN); Word Embedding; Integer Linear Programming (ILP);
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a model for extractive multi-document text summarization using a supervised learning approach. The model uses a convolutional neural networks (CNN) which is capable of learning sentence features on its own for sentence ranking. This approach has been used in order to avoid the overhead of extracting features from sentences manually. Integer linear programming (ILP) approach has been adopted for selecting sentences to generate the summary based on sentence ranks. This ILP model minimizes the redundancy in the generated summary. We have evaluated our proposed approach on the DUC 2007 dataset and its performance is found to be competitive or better in comparison with state-of-the-art systems.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 26 条
[1]  
[Anonymous], J APPL MATH, DOI DOI 10.7763/1J1E12013.V3
[2]  
[Anonymous], 2014, C EMPIRICAL METHODS
[3]  
Cao ZQ, 2015, AAAI CONF ARTIF INTE, P2153
[4]  
Dalal V., 2013, EM TRENDS ENG TECHN
[5]  
Edmundson H. P., 1969, J ACM JACM, V16
[6]   LexRank: Graph-based lexical centrality as salience in text summarization [J].
Erkan, G ;
Radev, DR .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 :457-479
[7]  
Galanis D., 2012, P COLING 2012
[8]  
Gillick D., 2009, P WORKSH INT LIN PRO, P1018
[9]  
Glorot X., 2010, ARTIF INTELL
[10]  
Gong Y., 2001, P 24 ANN INT ACM SIG, P1925