Evaluation of a Sentence Ranker for Text Summarization Based on Roget's Thesaurus

被引:0
作者
Kennedy, Alistair [1 ]
Szpakowicz, Stan [1 ]
机构
[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON, Canada
来源
TEXT, SPEECH AND DIALOGUE | 2010年 / 6231卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Evaluation is one of the hardest tasks in automatic text summarization. It is perhaps even harder to determine how much a particular component of a summarization system contributes to the success of the whole system. We examine how to evaluate the sentence ranking component using a corpus which has been partially labelled with Summary Content Units. To demonstrate this technique, we apply it to the evaluation of a new sentence-ranking system which uses Roget's Thesaurus. This corpus provides a quick and nearly automatic method of evaluating the quality of sentence ranking.
引用
收藏
页码:101 / 108
页数:8
相关论文
共 14 条
  • [11] Nastase V., 2006, P TEXTGRAPHS 1 WORKS, P29
  • [12] Nenkova A, 2004, HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, P145
  • [13] Centroid-based summarization of multiple documents
    Radev, DR
    Jing, HY
    Stys, M
    Tam, D
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2004, 40 (06) : 919 - 938
  • [14] Zhu M., 2004, 09 U WAT DEP STAT AC