HIERARCHICAL TEXT CLASSIFICATION USING CNNS WITH LOCAL APPROACHES

被引:3
|
作者
Krendzelak, Milan [1 ]
Jakab, Frantisek [1 ]
机构
[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Comp & Informat, Letna 9, Kosice 04001, Slovakia
关键词
Hierarchical text classification; convolutional neural network; local top-down approach;
D O I
10.31577/cai_2020_5_907
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we discuss the application of convolutional neural networks (CNNs) for hierarchical text classification using local top-down approaches. We present experimental results implementing a local classification per node approach, a local classification per parent node approach, and a local classification per level approach. A 20 Newsgroup hierarchical training dataset with more than 20 categories and three hierarchical levels was used to train the models. The experiments involved several variations of hyperparameters settings such as batch size, embedding size, and number of available examples from the training dataset, including two variation of CNN model text embedding such as static (stat) and random (rand). The results demonstrated that our proposed use of CNNs outperformed flat CNN baseline model and both the flat and hierarchical support vector machine (SVM) and logistic regression (LR) baseline models. In particular, hierarchical text classification with CNN-stat models using local per parent node and local per level approaches achieved compelling results and outperformed the former and latter state-of-the-art models. However, using CNN with local per node approach for hierarchical text classification underperformed and achieved worse results. Furthermore, we performed a detailed comparison between the proposed hierarchical local approaches with CNNs. The results indicated that the hierarchical local classification per level approach using the CNN model with static text embedding achieved the best results, surpassing the flat SVM and LR baseline models by 7 % and 13 %, surpassing the flat CNN baseline by 5 %, and surpassing the h-SVM and h-LR models by 5 % and 10 %, respectively.
引用
收藏
页码:907 / 924
页数:18
相关论文
共 50 条
  • [21] Non-Informative Frame Classification in Colonoscopy Videos Using CNNs
    Islam, Abm Rezbaul
    Alammari, Ali
    Oh, JungHwan
    Tavanapong, Wallapak
    Wong, Johnny
    de Groen, Piet C.
    PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON BIOMEDICAL IMAGING, SIGNAL PROCESSING (ICBSP 2018), 2018, : 49 - 56
  • [22] HLC: hierarchically-aware label correlation for hierarchical text classification
    Kumar, Ashish
    Toshinwal, Durga
    APPLIED INTELLIGENCE, 2024, 54 (02) : 1602 - 1618
  • [23] HLC: hierarchically-aware label correlation for hierarchical text classification
    Ashish Kumar
    Durga Toshinwal
    Applied Intelligence, 2024, 54 : 1602 - 1618
  • [24] Hierarchy-Aware and Label Balanced Model for Hierarchical Text Classification
    Zhang, Jun
    Li, Yubin
    Shen, Fanfan
    Xia, Chenxi
    Tan, Hai
    He, Yanxiang
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [25] Utilizing global and path information with language modelling for hierarchical text classification
    Oh, Heung-Seon
    Myaeng, Sung-Hyon
    JOURNAL OF INFORMATION SCIENCE, 2014, 40 (02) : 127 - 145
  • [26] A Study on Hierarchical Text Classification as a Seq2seq Task
    Torba, Fatos
    Gravier, Christophe
    Laclau, Charlotte
    Kammoun, Abderrhammen
    Subercaze, Julien
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT III, 2024, 14610 : 287 - 296
  • [27] HTCSI: A Hierarchical Text Classification Method Based on Selection-Inference
    Xu, Yiming
    Feng, Jianzhou
    Gu, Chenghan
    Qin, Haonan
    Xue, Kehan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT II, NLPCC 2024, 2025, 15360 : 307 - 318
  • [28] A Category Hybrid Embedding Based Approach for Power Text Hierarchical Classification
    Chen X.
    Gao P.
    Liang Y.
    Ma Y.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (01): : 77 - 82
  • [29] Hierarchical text classification with multi-label contrastive learning and KNN
    Zhang, Jun
    Li, Yubin
    Shen, Fanfan
    He, Yueshun
    Tan, Hai
    He, Yanxiang
    NEUROCOMPUTING, 2024, 577
  • [30] External methods to address limitations of using global information on the narrow-down approach for hierarchical text classification
    Oh, Heung-Seon
    Jung, Yuchul
    JOURNAL OF INFORMATION SCIENCE, 2014, 40 (05) : 688 - 708