Segmentation evaluation metrics, a comparison grounded on prosodic and discourse units

被引：0

作者：

Peshkov, Klim ^{[1
]}

Prevot, Laurent

机构：

[1] Aix Marseille Univ, Aix En Provence, France

来源：

LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2014年

关键词：

evaluation; segmentation; discourse; prosody;

D O I：

暂无

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

Knowledge on evaluation metrics and best practices of using them have improved fast in the recent years Fort et al. (2012). However, the advances concern mostly evaluation of classification related tasks. Segmentation tasks have received less attention. Nevertheless, there are crucial in a large number of linguistic studies. A range of metrics is available (F-score on boundaries, F-score on units, WindowDiff ((WD), Boundary Similarity (BS) but it is still relatively difficult to interpret these metrics on various linguistic segmentation tasks, such as prosodic and discourse segmentation. In this paper, we consider real segmented datasets (introduced in Peshkov et al. (2012)) as references which we deteriorate in different ways (random addition of boundaries, random removal boundaries, near-miss errors introduction). This provide us with various measures on controlled datasets and with an interesting benchmark for various linguistic segmentation tasks.

引用

页数：5

共 10 条

[1]

[Anonymous], P WORKSH COMP NAT LA

[2]

Carroll L., 2010, HUMAN LANGUAGE TECHN, P993

[3] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].

COHEN, J .

EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46

[4]

Fort K., 2012, P 8 INT C LANG RES E

[5]

Fournier C., 2012, Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, P152

[6]

Fournier C., 2013, P 51 ANN M ASS COMP, V5

[7]

Mathet Y., 2012, P INT C COMP LING CO, P809

[8]

Peshkov K., 2012, P SEMIAL 2012 SEINDI, P181

[9]

Peshkov K., 2013, P PROS DISC INT 2013

[10] A critique and improvement of an evaluation metric for text segmentation [J].

Pevzner, L ;

Hearst, MA .

COMPUTATIONAL LINGUISTICS, 2002, 28 (01) :19-36

← 1 →