Recent automatic text summarization techniques: a survey

被引:400
作者
Gambhir, Mahak [1 ]
Gupta, Vishal [1 ]
机构
[1] Panjab Univ, Univ Inst Engn & Technol, Chandigarh, India
关键词
Text summarization; Summarization survey; Text mining; Artificial intelligence; Information retrieval; Natural language processing; SINGLE-DOCUMENT; INFORMATION; FRAMEWORK; SYSTEM; OPTIMIZATION; RETRIEVAL; COHERENCE; KEYWORD; LEXRANK; MODELS;
D O I
10.1007/s10462-016-9475-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As information is available in abundance for every topic on internet, condensing the important information in the form of summary would benefit a number of users. Hence, there is growing interest among the research community for developing new approaches to automatically summarize the text. Automatic text summarization system generates a summary, i.e. short length text that includes all the important information of the document. Since the advent of text summarization in 1950s, researchers have been trying to improve techniques for generating summaries so that machine generated summary matches with the human made summary. Summary can be generated through extractive as well as abstractive methods. Abstractive methods are highly complex as they need extensive natural language processing. Therefore, research community is focusing more on extractive summaries, trying to achieve more coherent and meaningful summaries. During a decade, several extractive approaches have been developed for automatic summary generation that implements a number of machine learning and optimization techniques. This paper presents a comprehensive survey of recent text summarization extractive approaches developed in the last decade. Their needs are identified and their advantages and disadvantages are listed in a comparative manner. A few abstractive and multilingual text summarization approaches are also covered. Summary evaluation is another challenging issue in this research field. Therefore, intrinsic as well as extrinsic both the methods of summary evaluation are described in detail along with text summarization evaluation conferences and workshops. Furthermore, evaluation results of extractive summarization approaches are presented on some shared DUC datasets. Finally this paper concludes with the discussion of useful future directions that can help researchers to identify areas where further research is needed.
引用
收藏
页码:1 / 66
页数:66
相关论文
共 183 条
[1]  
Abuobieda A., 2012, 2012 International Conference on Information Retrieval & Knowledge Management (CAMP), P193, DOI 10.1109/InfRKM.2012.6204980
[2]   Multiple documents summarization based on evolutionary optimization algorithm [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Isazade, Nijat R. .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) :1675-1689
[3]   MCMR: Maximum coverage and minimum redundant text summarization model [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Hajirahimova, Makrufa S. ;
Mehdiyev, Chingiz A. .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) :14514-14522
[4]   A new sentence similarity measure and sentence based extractive technique for automatic text summarization [J].
Aliguliyev, Ramiz M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) :7764-7772
[5]  
Almeida M.B., 2013, Proceedings of ACL, P196
[6]  
AMIGO E, 2005, P 43 ANN M ASS COMP, P280
[7]   Incorporating Prior Knowledge into a Transductive Ranking Algorithm for Multi-Document Summarization [J].
Amini, Massih-Reza ;
Usunier, Nicolas .
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, :704-705
[8]  
[Anonymous], PROCEEDINGS OF THE I
[9]  
[Anonymous], COMPUTATIONAL LEARNI
[10]  
[Anonymous], 2006, P HUMAN LANGUAGE TEC