Research on sentence optimum selection algorithm for multi-document summarization

被引:0
作者
Zhang, Shu [1 ]
Zhao, Tie-Jun [1 ]
Yao, Chao [1 ]
Zheng, De-Quan [1 ]
机构
[1] Department of Computer Science and Technology, Harbin Institute of Technology
来源
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology | 2008年 / 30卷 / 12期
关键词
Multi-document summarization; Redundancy information processing; Sentence optimum selection;
D O I
10.3724/sp.j.1146.2007.00876
中图分类号
学科分类号
摘要
Analyzing sentences selection in summarization, an approach based on deleting sentences in a sentences set to obtain summary is proposed, which differs from the traditional method of adding sentences to get the summary. It has two stages, one is the process of obtaining the candidate summary sentences set with direct obtaining algorithm and redundancy-based obtaining algorithm, the other is the process of deleting sentences with sentences optimum algorithm. With DUC 2004 as the test corpus, the ROUGE value of summaries gotten by sentences selection proves the necessity of sentences optimum selection for multi-document summarization. Compared with the redundancy-based sentences selection method, the validity of the approach proposed is proved.
引用
收藏
页码:2921 / 2925
页数:4
相关论文
共 9 条
  • [1] Lin C.Y., Hovy E., From single to multi-document summarization: A prototype system and its evaluation, Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 457-464, (2002)
  • [2] Document Understanding Conferences
  • [3] NII Test Collection for IR Systems
  • [4] Conroy J.M., Schlesinger J.D., Left-brain/right-brain multi-document summarization, Proceedings of the 2004 Document Understanding Conference (DUC 2004), (2004)
  • [5] Lin C.Y., Hovy E., The automated acquisition of topic signatures for text summarization, Proceedings of the 18th International Conference on Computational Linguistics (COLING-2000), pp. 495-501, (2000)
  • [6] Blair-Goldensohn S., Evans D., Et al., Columbia university at DUC 2004, Proceedings of the 2004 Document Understanding Conference (DUC 2004), (2004)
  • [7] Erkan G., Radev G.R., The university of Michigan at DUC 2004, Proceedings of the 2004 Document Understanding Conference (DUC 2004), (2004)
  • [8] Otterbacher J.C., Winkel A.J., Radev G.R., The Michigan single and multi-document summarizer for DUC 2002, Proceedings of the 2002 Document Understanding Conference (DUC 2002), (2002)
  • [9] Lin C.Y., ROUGE: A package for automatic evaluation of summaries, Proceedings of the ACL 2004 Workshop on Text Summarization, pp. 74-81, (2004)