Experiments with hierarchical text classification

被引:0
|
作者
Granitzer, M [1 ]
Auer, P [1 ]
机构
[1] Know Ctr, Div Knowledge Discovery, A-8010 Graz, Austria
来源
PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING | 2005年
关键词
machine learning; supervised learning; hierarchical text classification; boosting; ranking performance;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper applies Boosting to hierarchical text classification where the hierarchical structure is given as directed acyclic graph and compares the results to Support Vector Machines. Hierarchical classification is performed top-down and in each node a flat classifier decides if a document should be further propagated or not. As flat classifiers BoosTexter, CentroidBooster and Support Vector Machines are used, were CentroidBooster is an AdaBoost.MH based alternative similar to BoosTexter. Experiments on the Reuters Corpus Volume 1 and the OHSUMED data set show that the F-1-measure increases if the hierarchal structure of a data set is taken into account. Regarding time complexity we show, that depending on the structure of a hierarchy, learning and classification time can be reduced. Besides these hard classification approaches we also investigate the ranking performance of hierarchical classifiers. Ranking, which can be achieved by providing a meaningful score for each classification decision, is important in most practical settings. We investigate an approach based on using a sigmoid function for calculating a meaningful score, where parameter estimation is based on error bounds from computational learning theory.
引用
收藏
页码:177 / 182
页数:6
相关论文
共 50 条
  • [1] Hierarchical Text Classification Incremental Learning
    Song, Shengli
    Qiao, Xiaofei
    Chen, Ping
    NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 247 - 258
  • [2] Hierarchical text classification methods and their specification
    Sun, AX
    Lim, EP
    Ng, WK
    COOPERATIVE INTERNET COMPUTING, 2003, 729 : 236 - 256
  • [3] Hierarchical LSTM network for text classification
    Keivan Borna
    Reza Ghanbari
    SN Applied Sciences, 2019, 1
  • [4] Hierarchical LSTM network for text classification
    Borna, Keivan
    Ghanbari, Reza
    SN APPLIED SCIENCES, 2019, 1 (09):
  • [5] Intelligent Funds Assistant Exploiting Hierarchical Text Classification Algorithms
    Saraiva, Ines
    Moniz, Daniela
    Almeida, Alexandre
    Sousa, Joao
    Vieira, Susana
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] Disentangled feature graph for Hierarchical Text Classification
    Liu, Renyuan
    Zhang, Xuejie
    Wang, Jin
    Zhou, Xiaobing
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)
  • [7] Text Classification with Imperfect Hierarchical Structure Knowledge
    Ngo-Ye, Thomas
    Dutt, Abhijit
    AMCIS 2010 PROCEEDINGS, 2010,
  • [8] JumpLiteGCN: A Lightweight Approach to Hierarchical Text Classification
    Liu, Teng
    Liu, Xiangzhi
    Dong, Yunfeng
    Wu, Xiaoming
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT IV, NLPCC 2024, 2025, 15362 : 54 - 66
  • [9] HIERARCHICAL TEXT CLASSIFICATION USING CNNS WITH LOCAL APPROACHES
    Krendzelak, Milan
    Jakab, Frantisek
    COMPUTING AND INFORMATICS, 2020, 39 (05) : 907 - 924
  • [10] Hierarchical Text Classification based on LDA and Domain Ontology
    An, Wei
    Liu, Qihua
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1112 - +