Manual Typification of Source Texts and Multi-document Summaries Alignments

被引:3
作者
Camargo, Renata T. [1 ]
Agostini, Veronica [2 ]
Di Felippo, Ariani [1 ]
Pardo, Thiago A. S. [2 ]
机构
[1] Univ Fed Sao Carlos, BR-13565905 Sao Carlos, SP, Brazil
[2] Univ Sao Paulo, BR-13566590 Sao Carlos, Brazil
来源
CORPUS RESOURCES FOR DESCRIPTIVE AND APPLIED STUDIES. CURRENT CHALLENGES AND FUTURE DIRECTIONS: SELECTED PAPERS FROM THE 5TH INTERNATIONAL CONFERENCE ON CORPUS LINGUISTICS (CILC2013) | 2013年 / 95卷
关键词
typification; alignment; summarization; AGREEMENT;
D O I
10.1016/j.sbspro.2013.10.674
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
The Multi-document Summarization (MDS) has been focused in Natural Language Processing (NLP) and its aim is to produce automatic summaries from a collection of texts that deal with the same subject (Mani, 2001). The alignment of human-written abstracts to their source documents makes explicit the correspondences that exist in such documents/abstract pairs and create a potentially rich data source to create of rules and models to support more linguistically motivated MDS methods. In this paper we describe the typification of such alignments in the CSTNews corpus. This work is part of two larger projects called Sucinto and Sustento, and it supports MSD researches of Brazilian Portuguese language. Specifically, the typification process consisted of assigning labels to the alignment between a summary sentence and its corresponding source sentence which codify formal and content aspects of the alignment. In order to present this work, we outline the alignment, and detail the typification process, the results of our work and some conclusions. (C) 2013 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:498 / 506
页数:9
相关论文
共 19 条
[1]  
Agostini V., 2013, MANUAL ALIGNMENT NEW
[2]  
[Anonymous], 2011, P 3 RST BRAZ M OCT
[3]  
[Anonymous], 1991, 29 ANN M ASS COMPUTA
[4]  
Barzilay R, 2003, PROCEEDINGS OF THE 2003 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, P25
[5]  
Carletta J, 1996, COMPUT LINGUIST, V22, P249
[6]  
Caseli H. M., 2003, ALINHAMENTO SENTENCI, P101
[7]  
Clough P., 2001, METER MEASURINGTEXT
[8]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[9]   Induction of word and phrase alignments for automatic document summarization [J].
Daume, H ;
Marcu, D .
COMPUTATIONAL LINGUISTICS, 2005, 31 (04) :505-530
[10]  
Daume III H., 2004, EMPIRICAL METHODS NA, P8