GReAT A Model for the Automatic Generation of Text Summaries

被引:1
作者
Gomez Puyana, Claudia [1 ]
Pomares Quimbaya, Alexandra [1 ]
机构
[1] Pontificia Univ Javeriana, Bogota, Colombia
来源
ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1 | 2013年
关键词
Text Mining; Summary Generation; Natural Language Processing;
D O I
10.5220/00044546028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The excessive amount of available narrative texts within diverse domains such as health (e.g. medical records), justice (e.g. laws, declarations), assurance (e.g. declarations), etc. increases the required time for the analysis of information in a decision making process. Different approaches of summary generation of these texts have been proposed to solve this problem. However, some of them do not take into account the sequentiality of the original document, which reduces the quality of the final summary, other ones create overall summaries that do not satisfy the end user who requires a summary that is related to his profile (e.g. different medical specializations require different information) and others do not analyze the potential duplication of information and the noise of natural language on the summary. To cope these problems this paper presents GReAT a model for automatic summarization that relies on natural language processing and text mining techniques to extract the most relevant information from narrative texts focused on the requirements of the end user. GReAT is an extraction based summary generation model which principle is to identify the users relevant information filtering the text by topic and frequency of words, also it reduces the number of phrases of the summary avoiding the duplication of information. Experimental results show that the functionality of GReAT improves the quality of the summary over other existing methods
引用
收藏
页码:280 / 288
页数:9
相关论文
共 29 条
[1]  
Abu-Jbara A., 2011, P 49 ANN M ASS COMP, V1, P500
[2]  
[Anonymous], 2009, P 11 INT C INF INT W
[3]  
[Anonymous], 2006, PROC 15 ACM INT C IN, DOI [10.1145/1183614.1183701, DOI 10.1145/1183614.1183701]
[4]  
[Anonymous], 2009, P HUM LANG TECHN 200, DOI DOI 10.3115/1620754.1620839
[5]  
Arora R., 2008, 08, P91, DOI DOI 10.1145/1390749.1390764
[6]  
Bossard A., 2009, P 12 C EUR CHAPT ASS, P5
[7]  
Brun, 2004, PROFESIONAL INFORM, V13
[8]   A hybrid approach to automatic text summarization [J].
Chang, Te-Min ;
Hsiao, Wen-Feng .
2008 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, :65-+
[9]  
Dalal M. K., 2011, INT C WORKSH EM TREN, P690
[10]  
Daume H, 2002, 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P449