The challenging task of summary evaluation: an overview

被引:0
作者
Elena Lloret
Laura Plaza
Ahmet Aker
机构
[1] Universidad de Alicante,
[2] IR & NLP UNED,undefined
[3] University of Duisburg-Essen,undefined
来源
Language Resources and Evaluation | 2018年 / 52卷
关键词
Text summarization; Evaluation; Content evaluation; Readability; Task-based evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
Evaluation is crucial in the research and development of automatic summarization applications, in order to determine the appropriateness of a summary based on different criteria, such as the content it contains, and the way it is presented. To perform an adequate evaluation is of great relevance to ensure that automatic summaries can be useful for the context and/or application they are generated for. To this end, researchers must be aware of the evaluation metrics, approaches, and datasets that are available, in order to decide which of them would be the most suitable to use, or to be able to propose new ones, overcoming the possible limitations that existing methods may present. In this article, a critical and historical analysis of evaluation metrics, methods, and datasets for automatic summarization systems is presented, where the strengths and weaknesses of evaluation efforts are discussed and the major challenges to solve are identified. Therefore, a clear up-to-date overview of the evolution and progress of summarization evaluation is provided, giving the reader useful insights into the past, present and latest trends in the automatic evaluation of summaries.
引用
收藏
页码:101 / 148
页数:47
相关论文
共 69 条
  • [1] Barzilay R(2008)Modeling local coherence: An entity-based approach Computational Linguistics 34 1-34
  • [2] Lapata M(2012)A framework for obtaining structurally complex condensed representations of document sets in the biomedical domain Procesamiento del Lenguaje Natural 49 21-8
  • [3] Berlanga Llavori R(2007)Automatic summary evaluation based on text grammars Journal of Digital Information 8 1-6
  • [4] Ramírez Cruz Y(2011)Natural language processing (almost) from scratch Journal of Machine Learning Research 12 2493-2537
  • [5] Gil García R(1969)New methods in automatic extracting Journal of the ACM (JACM) 16 264-85
  • [6] Branny E(2005)Customization in a unified framework for summarizing medical literature Artificial Intelligence in Medicine 33 179-198
  • [7] Collobert R(2017)Recent automatic text summarization techniques: A survey Artificial Intelligence Review 47 1-66
  • [8] Weston J(2008)Summarization system evaluation revisited: N-gram graphs ACM Transactions on Speech and Language Processing 5 1-39
  • [9] Bottou L(1995)Centering: A framework for modeling the local coherence of discourse Computational Linguistics 21 203-25
  • [10] Karlen M(2013)MeSH indexing based on automatically generated summaries BMC Bioinformatics 14 208-747