An Ensemble Framework for Text Classification

被引:0
|
作者
Kamateri, Eleni [1 ]
Salampasis, Michail [1 ]
机构
[1] Int Hellen Univ, Dept Informat & Elect Engn, Alexander Campus,POB 141, Thessaloniki 57400, Greece
关键词
ensemble learning; ensemble framework; text classification; patent classification; NEURAL-NETWORKS;
D O I
10.3390/info16020085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ensemble learning can improve predictive performance compared to the performance of any of its constituents alone, while keeping computational demands manageable. However, no reference methodology is available for developing ensemble systems. In this paper, we adapt an ensemble framework for patent classification to assist data scientists in creating flexible ensemble architectures for text classification by selecting a finite set of constituent base models from the many available alternatives. We analyze the axes along which someone can select base models of an ensemble system and propose a methodology for combining them. Moreover, we conduct experiments to compare the effectiveness of ensemble systems against base models and state-of-the-art methods on multiple datasets (three patent classification and two text classification datasets), including long and short texts and single- and/or multi-labeled texts. The results verify the generality of our framework and the effectiveness of ensemble systems, especially ensembles of classifiers trained on different data sections/metadata.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Rumor Classification through a Multimodal Fusion Framework and Ensemble Learning
    Azri, Abderrazek
    Favre, Cecile
    Harbi, Nouria
    Darmont, Jerome
    Nous, Camille
    INFORMATION SYSTEMS FRONTIERS, 2022, 25 (5) : 1795 - 1810
  • [42] Rumor Classification through a Multimodal Fusion Framework and Ensemble Learning
    Abderrazek Azri
    Cécile Favre
    Nouria Harbi
    Jérôme Darmont
    Camille Noûs
    Information Systems Frontiers, 2023, 25 : 1795 - 1810
  • [43] Emotional Text Analysis Based on Ensemble Learning of Three Different Classification Algorithms
    Bian, WenShuo
    Wang, ChunZhi
    Ye, ZhiWei
    Yan, Lingyu
    PROCEEDINGS OF THE 2019 10TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS - TECHNOLOGY AND APPLICATIONS (IDAACS), VOL. 2, 2019, : 938 - 941
  • [44] A Novel Neural Ensemble Architecture for On-the-fly Classification of Evolving Text Streams
    Ghahramanian, Pouya
    Bakhshi, Sepehr
    Bonab, Hamed
    Can, Fazli
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [45] Novel approach with nature-inspired and ensemble techniques for optimal text classification
    Khurana, Anshu
    Verma, Om Prakash
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (33-34) : 23821 - 23848
  • [46] Novel approach with nature-inspired and ensemble techniques for optimal text classification
    Anshu Khurana
    Om Prakash Verma
    Multimedia Tools and Applications, 2020, 79 : 23821 - 23848
  • [47] Text Classification Based on a Novel Ensemble Multi-Label Learning Method
    Zhang, Tao
    Wu, Jiansheng
    Hu, Haifeng
    2014 2ND INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2014, : 964 - 968
  • [48] Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification
    El Hindi, Khalil
    AlSalman, Hussien
    Qasem, Safwan
    Al Ahmadi, Saad
    ENTROPY, 2018, 20 (11)
  • [49] A Methodological Framework for Dictionary and Rule-based Text Classification
    Abel, Jennifer
    Lantow, Birger
    KDIR: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL 1: KDIR, 2019, : 330 - 337
  • [50] A Submodular Optimization Framework for Imbalanced Text Classification With Data Augmentation
    Alemayehu, Eyor
    Fang, Yi
    IEEE ACCESS, 2023, 11 : 41680 - 41696