Experiments with hierarchical text classification

被引:0
|
作者
Granitzer, M [1 ]
Auer, P [1 ]
机构
[1] Know Ctr, Div Knowledge Discovery, A-8010 Graz, Austria
来源
PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING | 2005年
关键词
machine learning; supervised learning; hierarchical text classification; boosting; ranking performance;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper applies Boosting to hierarchical text classification where the hierarchical structure is given as directed acyclic graph and compares the results to Support Vector Machines. Hierarchical classification is performed top-down and in each node a flat classifier decides if a document should be further propagated or not. As flat classifiers BoosTexter, CentroidBooster and Support Vector Machines are used, were CentroidBooster is an AdaBoost.MH based alternative similar to BoosTexter. Experiments on the Reuters Corpus Volume 1 and the OHSUMED data set show that the F-1-measure increases if the hierarchal structure of a data set is taken into account. Regarding time complexity we show, that depending on the structure of a hierarchy, learning and classification time can be reduced. Besides these hard classification approaches we also investigate the ranking performance of hierarchical classifiers. Ranking, which can be achieved by providing a meaningful score for each classification decision, is important in most practical settings. We investigate an approach based on using a sigmoid function for calculating a meaningful score, where parameter estimation is based on error bounds from computational learning theory.
引用
收藏
页码:177 / 182
页数:6
相关论文
共 50 条
  • [21] Adaptive Hierarchical Text Classification Using ERNIE and Dynamic Threshold Pruning
    Chen, Han
    Zhang, Yangsen
    Jiang, Yuru
    Duan, Ruixue
    IEEE ACCESS, 2024, 12 : 193641 - 193652
  • [22] Hierarchy-Aware and Label Balanced Model for Hierarchical Text Classification
    Zhang, Jun
    Li, Yubin
    Shen, Fanfan
    Xia, Chenxi
    Tan, Hai
    He, Yanxiang
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [23] Utilizing global and path information with language modelling for hierarchical text classification
    Oh, Heung-Seon
    Myaeng, Sung-Hyon
    JOURNAL OF INFORMATION SCIENCE, 2014, 40 (02) : 127 - 145
  • [24] A Study on Hierarchical Text Classification as a Seq2seq Task
    Torba, Fatos
    Gravier, Christophe
    Laclau, Charlotte
    Kammoun, Abderrhammen
    Subercaze, Julien
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT III, 2024, 14610 : 287 - 296
  • [25] HTCSI: A Hierarchical Text Classification Method Based on Selection-Inference
    Xu, Yiming
    Feng, Jianzhou
    Gu, Chenghan
    Qin, Haonan
    Xue, Kehan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT II, NLPCC 2024, 2025, 15360 : 307 - 318
  • [26] A Category Hybrid Embedding Based Approach for Power Text Hierarchical Classification
    Chen X.
    Gao P.
    Liang Y.
    Ma Y.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (01): : 77 - 82
  • [27] Hierarchical text classification with multi-label contrastive learning and KNN
    Zhang, Jun
    Li, Yubin
    Shen, Fanfan
    He, Yueshun
    Tan, Hai
    He, Yanxiang
    NEUROCOMPUTING, 2024, 577
  • [28] NETHIC: A System for Automatic Text Classification using Neural Networks and Hierarchical Taxonomies
    Ciapetti, Andrea
    Di Florio, Rosario
    Lomasto, Luigi
    Miscione, Giuseppe
    Ruggiero, Giulia
    Toti, Daniele
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS), VOL 1, 2019, : 296 - 306
  • [29] Adaptive micro- and macro-knowledge incorporation for hierarchical text classification
    Feng, Zijian
    Mao, Kezhi
    Zhou, Hanzhang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 248
  • [30] HeteroHTC: Enhancing Hierarchical Text Classification via Heterogeneity Encoding of Label Hierarchy
    Song, Junru
    Chen, Tianlei
    Yang, Yang
    Wang, Feifei
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 271