HIERARCHICAL TEXT CLASSIFICATION USING CNNS WITH LOCAL APPROACHES

被引:3
|
作者
Krendzelak, Milan [1 ]
Jakab, Frantisek [1 ]
机构
[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Comp & Informat, Letna 9, Kosice 04001, Slovakia
关键词
Hierarchical text classification; convolutional neural network; local top-down approach;
D O I
10.31577/cai_2020_5_907
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we discuss the application of convolutional neural networks (CNNs) for hierarchical text classification using local top-down approaches. We present experimental results implementing a local classification per node approach, a local classification per parent node approach, and a local classification per level approach. A 20 Newsgroup hierarchical training dataset with more than 20 categories and three hierarchical levels was used to train the models. The experiments involved several variations of hyperparameters settings such as batch size, embedding size, and number of available examples from the training dataset, including two variation of CNN model text embedding such as static (stat) and random (rand). The results demonstrated that our proposed use of CNNs outperformed flat CNN baseline model and both the flat and hierarchical support vector machine (SVM) and logistic regression (LR) baseline models. In particular, hierarchical text classification with CNN-stat models using local per parent node and local per level approaches achieved compelling results and outperformed the former and latter state-of-the-art models. However, using CNN with local per node approach for hierarchical text classification underperformed and achieved worse results. Furthermore, we performed a detailed comparison between the proposed hierarchical local approaches with CNNs. The results indicated that the hierarchical local classification per level approach using the CNN model with static text embedding achieved the best results, surpassing the flat SVM and LR baseline models by 7 % and 13 %, surpassing the flat CNN baseline by 5 %, and surpassing the h-SVM and h-LR models by 5 % and 10 %, respectively.
引用
收藏
页码:907 / 924
页数:18
相关论文
共 50 条
  • [31] Exploring deep learning approaches for Urdu text classification in product manufacturing
    Akhter, Muhammad Pervez
    Jiangbin, Zheng
    Naqvi, Irfan Raza
    Abdelmajeed, Mohammed
    Fayyaz, Muhammad
    ENTERPRISE INFORMATION SYSTEMS, 2022, 16 (02) : 223 - 248
  • [32] Text Classification for a Large-Scale Taxonomy Using Dynamically Mixed Local and Global Models for a Node
    Oh, Heung-Seon
    Choi, Yoonjung
    Myaeng, Sung-Hyon
    ADVANCES IN INFORMATION RETRIEVAL, 2011, 6611 : 7 - 18
  • [33] Adaptive micro- and macro-knowledge incorporation for hierarchical text classification
    Feng, Zijian
    Mao, Kezhi
    Zhou, Hanzhang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 248
  • [34] Two-channel hierarchical attention mechanism model for short text classification
    Chang, Guanghui
    Hu, Shiyang
    Huang, Haihui
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (06): : 6991 - 7013
  • [35] HeteroHTC: Enhancing Hierarchical Text Classification via Heterogeneity Encoding of Label Hierarchy
    Song, Junru
    Chen, Tianlei
    Yang, Yang
    Wang, Feifei
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 271
  • [36] GACaps-HTC: graph attention capsule network for hierarchical text classification
    Jinhyun Bang
    Jonghun Park
    Jonghyuk Park
    Applied Intelligence, 2023, 53 : 20577 - 20594
  • [37] GACaps-HTC: graph attention capsule network for hierarchical text classification
    Bang, Jinhyun
    Park, Jonghun
    Park, Jonghyuk
    APPLIED INTELLIGENCE, 2023, 53 (17) : 20577 - 20594
  • [38] A Flower Classification Framework Based on Ensemble of CNNs
    Huang, Buzhen
    Hu, Youpeng
    Sun, Yaoqi
    Hao, Xinhong
    Yan, Chenggang
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 235 - 244
  • [39] IMU-Based Fitness Activity Recognition Using CNNs for Time Series Classification
    Mueller, Philipp Niklas
    Mueller, Alexander Josef
    Achenbach, Philipp
    Goebel, Stefan
    SENSORS, 2024, 24 (03)
  • [40] Classification of brain tumor types through MRIs using parallel CNNs and firefly optimization
    Li, Chen
    Zhang, Faxue
    Du, Yongjian
    Li, Huachao
    SCIENTIFIC REPORTS, 2024, 14 (01):