Novel top-down methods for Hierarchical Text Classification

被引:3
作者
Cao Ying [1 ]
Duan run-ying [1 ]
机构
[1] Jiangxi Univ Sci & Technol, Modern Educ Technol & Informat Ctr, Ganzhou 341000, Jiangxi, Peoples R China
来源
INTERNATIONAL CONFERENCE ON ADVANCES IN ENGINEERING 2011 | 2011年 / 24卷
关键词
hierarchical classification; virtual category; top-down approach;
D O I
10.1016/j.proeng.2011.11.2651
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
To classify large-scale text corpora, one common approach is using hierarchical text classification and classifying text documents in a top-down manner. Classification methods using top-down approach can scale well and cope with changes to the category trees. However, all these methods suffer from a common problem: a high level of misclassification document has unrecoverable. We define an virtual subclass for each non-leaf category to help the rejected documents go back to ancestor category, thus improving the overall performance. Our experiments using Support Vector Machine (SVM) classifiers on the 20newsgroup collection have shown that they all could reduce blocking and improve the classification accuracy. Our experiments have also shown that the virtual category method delivered the best performance. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of ICAE2011.
引用
收藏
页码:329 / 334
页数:6
相关论文
共 50 条
  • [41] Virtual Faculty Development Using Top-down Implementation Strategy and Adapted EES Model
    Drlik, Martin
    Skalka, Jan
    WORLD CONFERENCE ON EDUCATIONAL TECHNOLOGY RESEARCHES-2011, 2011, 28
  • [42] Bayesian network models for hierarchical text classification from a thesaurus
    de Campos, Luis M.
    Romero, Alfonso E.
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2009, 50 (07) : 932 - 944
  • [43] Semantic prosody of Slovene adverb-verb collocations: introducing the top-down approach
    Jurko, Primoz
    CORPORA, 2022, 17 (01) : 39 - 67
  • [44] Comparing Hierarchical Approaches to Enhance Supervised Emotive Text Classification
    Williams, Lowri
    Anthi, Eirini
    Burnap, Pete
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (04)
  • [45] Top-down fabrication of biodegradable multilayer tunicate cellulose films with controlled mechanical properties
    Huang, Da
    Li, Dong
    Mo, Kangwei
    Xu, Rui
    Huang, Yanan
    Cui, Yande
    Zhang, Qunchao
    Chang, Chunyu
    CELLULOSE, 2021, 28 (16) : 10415 - 10424
  • [46] Regionalizing a Tourism Satellite Account: A top-down approach based on existing data sources
    Frent, Cristi
    ARGUMENTA OECONOMICA, 2023, 50 (01): : 205 - 226
  • [47] Hybrid embedding-based text representation for hierarchical multi-label text classification
    Ma, Yinglong
    Liu, Xiaofeng
    Zhao, Lijiao
    Liang, Yue
    Zhang, Peng
    Jin, Beihong
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [48] Top-down fabrication of horizontally-aligned gallium nitride nanowire arrays for sensor development
    Liu, Guannan
    Wen, Baomei
    Xie, Ting
    Castillo, Audie
    Ha, Jong-Yong
    Sullivan, Nichole
    Debnath, Ratan
    Davydov, Albert
    Peckerar, Martin
    Motayed, Abhishek
    MICROELECTRONIC ENGINEERING, 2015, 142 : 58 - 63
  • [49] Estimating the impact of rainfall seasonality on mean annual water balance using a top-down approach
    Hickel, Klaus
    Zhang, Lu
    JOURNAL OF HYDROLOGY, 2006, 331 (3-4) : 409 - 424
  • [50] Direct Top-Down Fabrication of Large-Area Graphene Arrays by an In Situ Etching Method
    Geng, Dechao
    Wang, Huaping
    Wan, Yu
    Xu, Zhiping
    Luo, Birong
    Xu, Jie
    Yu, Gui
    ADVANCED MATERIALS, 2015, 27 (28) : 4195 - 4199