Evaluation measures for hierarchical classification: a unified view and novel approaches

被引:108
作者
Kosmopoulos, Aris [1 ,2 ]
Partalas, Ioannis [3 ]
Gaussier, Eric [3 ]
Paliouras, Georgios [1 ]
Androutsopoulos, Ion [2 ]
机构
[1] Natl Ctr Sci Res Demokritos, Athens, Greece
[2] Athens Univ Econ & Business, Athens, Greece
[3] Univ Grenoble 1, Lab Informat Grenoble, Grenoble, France
关键词
Evaluation; Evaluation measures; Hierarchical classification; Tree-structured class hierarchies; DAG-structured class hierarchies;
D O I
10.1007/s10618-014-0382-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical classification addresses the problem of classifying items into a hierarchy of classes. An important issue in hierarchical classification is the evaluation of different classification algorithms, an issue which is complicated by the hierarchical relations among the classes. Several evaluation measures have been proposed for hierarchical classification using the hierarchy in different ways without however providing a unified view of the problem. This paper studies the problem of evaluation in hierarchical classification by analysing and abstracting the key components of the existing performance measures. It also proposes two alternative generic views of hierarchical evaluation and introduces two corresponding novel measures. The proposed measures, along with the state-of-the-art ones, are empirically tested on three large datasets from the domain of text classification. The empirical results illustrate the undesirable behaviour of existing approaches and how the proposed methods overcome most of these problems across a range of cases.
引用
收藏
页码:820 / 865
页数:46
相关论文
共 23 条
  • [21] Hierarchical text classification and evaluation
    Sun, AX
    Lim, EP
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 521 - 528
  • [22] WILCOXON F, 1945, BIOMETRICS BULL, V1, P80, DOI 10.1093/jee/39.2.269
  • [23] Yang YM, 1999, SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P42, DOI 10.1145/312624.312647