Experiments with hierarchical text classification

被引:0
|
作者
Granitzer, M [1 ]
Auer, P [1 ]
机构
[1] Know Ctr, Div Knowledge Discovery, A-8010 Graz, Austria
来源
PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING | 2005年
关键词
machine learning; supervised learning; hierarchical text classification; boosting; ranking performance;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper applies Boosting to hierarchical text classification where the hierarchical structure is given as directed acyclic graph and compares the results to Support Vector Machines. Hierarchical classification is performed top-down and in each node a flat classifier decides if a document should be further propagated or not. As flat classifiers BoosTexter, CentroidBooster and Support Vector Machines are used, were CentroidBooster is an AdaBoost.MH based alternative similar to BoosTexter. Experiments on the Reuters Corpus Volume 1 and the OHSUMED data set show that the F-1-measure increases if the hierarchal structure of a data set is taken into account. Regarding time complexity we show, that depending on the structure of a hierarchy, learning and classification time can be reduced. Besides these hard classification approaches we also investigate the ranking performance of hierarchical classifiers. Ranking, which can be achieved by providing a meaningful score for each classification decision, is important in most practical settings. We investigate an approach based on using a sigmoid function for calculating a meaningful score, where parameter estimation is based on error bounds from computational learning theory.
引用
收藏
页码:177 / 182
页数:6
相关论文
共 50 条
  • [41] Feature Selection Algorithm for Hierarchical Text Classification Using Kullback-Leibler Divergence
    Yao Lifang
    Qin Sijun
    Zhu Huan
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2017), 2017, : 421 - 424
  • [42] Hierarchical gated recurrent neural network with adversarial and virtual adversarial training on text classification
    Poon, Hoon-Keng
    Yap, Wun-She
    Tee, Yee-Kai
    Lee, Wai-Kong
    Goi, Bok-Min
    NEURAL NETWORKS, 2019, 119 : 299 - 312
  • [43] Boosting multi-label hierarchical text categorization
    Esuli, Andrea
    Fagni, Tiziano
    Sebastiani, Fabrizio
    INFORMATION RETRIEVAL, 2008, 11 (04): : 287 - 313
  • [44] Boosting multi-label hierarchical text categorization
    Andrea Esuli
    Tiziano Fagni
    Fabrizio Sebastiani
    Information Retrieval, 2008, 11 : 287 - 313
  • [45] Large-Scale Hierarchical Text Classification with Recursively Regularized Deep Graph-CNN
    Peng, Hao
    Li, Jianxin
    He, Yu
    Liu, Yaopeng
    Bao, Mengjiao
    Wang, Lihong
    Song, Yangqiu
    Yang, Qiang
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1063 - 1072
  • [46] HMATC: Hierarchical multi-label Arabic text classification model using machine learning
    Aljedani, Nawal
    Alotaibi, Reem
    Taileb, Mounira
    EGYPTIAN INFORMATICS JOURNAL, 2021, 22 (03) : 225 - 237
  • [47] Does the Order Matter? A Random Generative Way to Learn Label Hierarchy for Hierarchical Text Classification
    Yan, Jingsong
    Li, Piji
    Chen, Haibin
    Zheng, Junhao
    Ma, Qianli
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 276 - 285
  • [48] Research article classification with text mining method
    Gurbuz, Tugba
    Uluyol, Celebi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (01):
  • [49] Converting numerical classification into text classification
    Macskassy, SA
    Hirsh, H
    Banerjee, A
    Dayanik, AA
    ARTIFICIAL INTELLIGENCE, 2003, 143 (01) : 51 - 77
  • [50] Using hierarchical text classification to investigate the utility of machine learning in automating online analyses of wildlife exploitation
    Hunter, Sara Bronwen
    Mathews, Fiona
    Weeds, Julie
    ECOLOGICAL INFORMATICS, 2023, 75