An Ensemble Framework for Text Classification

被引:0
|
作者
Kamateri, Eleni [1 ]
Salampasis, Michail [1 ]
机构
[1] Int Hellen Univ, Dept Informat & Elect Engn, Alexander Campus,POB 141, Thessaloniki 57400, Greece
关键词
ensemble learning; ensemble framework; text classification; patent classification; NEURAL-NETWORKS;
D O I
10.3390/info16020085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ensemble learning can improve predictive performance compared to the performance of any of its constituents alone, while keeping computational demands manageable. However, no reference methodology is available for developing ensemble systems. In this paper, we adapt an ensemble framework for patent classification to assist data scientists in creating flexible ensemble architectures for text classification by selecting a finite set of constituent base models from the many available alternatives. We analyze the axes along which someone can select base models of an ensemble system and propose a methodology for combining them. Moreover, we conduct experiments to compare the effectiveness of ensemble systems against base models and state-of-the-art methods on multiple datasets (three patent classification and two text classification datasets), including long and short texts and single- and/or multi-labeled texts. The results verify the generality of our framework and the effectiveness of ensemble systems, especially ensembles of classifiers trained on different data sections/metadata.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] An Optimized Arabic Multilabel Text Classification Approach Using Genetic Algorithm and Ensemble Learning
    Alzanin, Samah M.
    Gumaei, Abdu
    Haque, Md Azimul
    Muaad, Abdullah Y.
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [22] REDUCING THE EFFECT OF IMBALANCE IN TEXT CLASSIFICATION USING SVD AND GLOVE WITH ENSEMBLE AND DEEP LEARNING
    Hossain, Tajbia
    Mauni, Humaira Zahin
    Rab, Rageebir
    COMPUTING AND INFORMATICS, 2022, 41 (01) : 98 - 115
  • [23] Adaptable Term Weighting Framework for Text Classification
    Huynh, Dat
    Dat Tran
    Ma, Wanli
    Sharma, Dharmendra
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II, 2011, 6609 : 254 - 265
  • [24] Simplified-Boosting Ensemble Convolutional Network for Text Classification
    Fang Zeng
    Niannian Chen
    Dan Yang
    Zhigang Meng
    Neural Processing Letters, 2022, 54 : 4971 - 4986
  • [25] A Hybrid AIS-SVM Ensemble Approach for Text Classification
    Antunes, Mario
    Silva, Catarina
    Ribeiro, Bernardete
    Correia, Manuel
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT II, 2011, 6594 : 342 - +
  • [26] A Combination of Resampling and Ensemble Method for Text Classification on Imbalanced Data
    Feng, Haijun
    Qin, Wen
    Wang, Huijing
    Li, Yi
    Hu, Guangwu
    BIG DATA, BIGDATA 2021, 2022, 12988 : 3 - 16
  • [27] Simplified-Boosting Ensemble Convolutional Network for Text Classification
    Zeng, Fang
    Chen, Niannian
    Yang, Dan
    Meng, Zhigang
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 4971 - 4986
  • [28] Hybrid supervised clustering based ensemble scheme for text classification
    Onan, Aytug
    KYBERNETES, 2017, 46 (02) : 330 - 348
  • [29] Text classification framework for short text based on TFIDF-FastText
    Shrutika Chawla
    Ravreet Kaur
    Preeti Aggarwal
    Multimedia Tools and Applications, 2023, 82 : 40167 - 40180
  • [30] Text classification framework for short text based on TFIDF-FastText
    Chawla, Shrutika
    Kaur, Ravreet
    Aggarwal, Preeti
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 40167 - 40180