Tagging and Labelling Portuguese Modal Verbs

被引:0
作者
Quaresma, Paulo [1 ,4 ]
Mendes, Amalia [2 ]
Hendrickx, Iris [2 ,3 ]
Goncalves, Teresa [1 ]
机构
[1] Univ Evora, Dept Informat, Evora, Portugal
[2] Univ Lisbon, Ctr Linguist, P-1699 Lisbon, Portugal
[3] Radboud Univ Nijmegen, Ctr Language Studies, NL-6525 ED Nijmegen, Netherlands
[4] INESC ID, Spoken Language Syst Lab L2F, Evora, Portugal
来源
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE | 2014年 / 8775卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present in this paper an experiment in automatically tagging a set of Portuguese modal verbs with modal information. Modality is the expression of the speaker's (or the subject's) attitude towards the content of the sentences and may be marked with lexical clues such as verbs, adverbs, adjectives, but also by mood and tense. Here we focus exclusively on 9 verbal clues that are frequent in Portuguese and that may have more than one modal meaning. We use as our gold data set a corpus of 160.000 tokens manually annotated, according to a modality annotation scheme for Portuguese. We apply a machine learning approach to predict the modal meaning of a verb in context. This modality tagger takes into consideration all the features available from the parsed data (pos, syntactic and semantic). The results show that the tagger improved the baseline for all verbs, and reached macro-average F-measures between 35 and 81% depending on the modal verb and on the modal value.
引用
收藏
页码:70 / 81
页数:12
相关论文
共 26 条
[1]  
[Anonymous], 1998, LINGUIST TYPOL
[2]  
Baker Kathryn, 2010, P 7 INT C LANG RES E
[3]  
Battistelli D., 2013, P IWCS 2013 WORKSH A, P7
[4]  
Berleant D, 1995, NAT LANG ENG, V1, P339
[5]  
Bick E., 1999, PARSING SYSTEM PALAV
[6]  
Diab Mona, 2009, P 3 LING ANN WORKSH, P68
[7]  
Farkas R., 2010, P 14 C COMPUTATIONAL, P1
[8]  
Gasperin C., 2003, P TALN WORKSH NAT LA, P223
[9]  
Généreux M, 2012, LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P2237
[10]  
Hall M., 2009, SIGKDD Explorations, V11, P10, DOI [10.1145/1656274.1656278, DOI 10.1145/1656274.1656278]