Assessing sentence scoring techniques for extractive text summarization

被引:195
作者
Ferreira, Rafael [1 ]
Cabral, Luciano de Souza [1 ]
Lins, Rafael Dueire [1 ]
Pereira e Silva, Gabriel [1 ]
Freitas, Fred [1 ]
Cavalcanti, George D. C. [1 ]
Lima, Rinaldo [1 ]
Simske, Steven J. [2 ]
Favaro, Luciano [3 ]
机构
[1] Univ Fed Pernambuco, Informat Ctr, Recife, PE, Brazil
[2] Hewlett Packard Labs, Ft Collins, CO 80528 USA
[3] Hewlett Packard Brazil, Barueri, Brazil
关键词
Extractive summarization; Sentence scoring methods; Summarization evaluation;
D O I
10.1016/j.eswa.2013.04.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text summarization is the process of automatically creating a shorter version of one or more text documents. It is an important way of finding relevant information in large text libraries or in the Internet. Essentially, text summarization techniques are classified as Extractive and Abstractive. Extractive techniques perform text summarization by selecting sentences of documents according to some criteria. Abstractive summaries attempt to improve the coherence among sentences by eliminating redundancies and clarifying the contest of sentences. In terms of extractive summarization, sentence scoring is the technique most used for extractive text summarization. This paper describes and performs a quantitative and qualitative assessment of 15 algorithms for sentence scoring available in the literature. Three different datasets (News, Blogs and Article contexts) were evaluated. In addition, directions to improve the sentence extraction results obtained are suggested. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5755 / 5764
页数:10
相关论文
共 36 条
[1]  
Abuobieda A., 2012, 2012 International Conference on Information Retrieval & Knowledge Management (CAMP), P193, DOI 10.1109/InfRKM.2012.6204980
[2]  
[Anonymous], 2006, THESIS
[3]  
[Anonymous], 2008, P 31 ANN INT ACM SIG, DOI DOI 10.1145/1390334.1390385
[4]  
Baeza-Yates R, 1999, MODERN INFORM RETRIE, V463
[5]  
Balahur A, 2009, P WORKSH EV EM TEXT, P23
[6]  
Barrera Araly, 2012, Computational Linguistics and Intelligent Text Processing. 13th International Conference (CICLing 2012). Proceedings, Part II, P366, DOI 10.1007/978-3-642-28601-8_31
[7]  
Barzilay R., 1997, Intelligent Scalable Text Summarization. Proceedings of a Workshop, P10
[8]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
[9]  
2-9
[10]  
Dongmei Zhang, 2012, 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery, P1309, DOI 10.1109/FSKD.2012.6233871