An Ensemble Framework for Text Classification

被引:0
|
作者
Kamateri, Eleni [1 ]
Salampasis, Michail [1 ]
机构
[1] Int Hellen Univ, Dept Informat & Elect Engn, Alexander Campus,POB 141, Thessaloniki 57400, Greece
关键词
ensemble learning; ensemble framework; text classification; patent classification; NEURAL-NETWORKS;
D O I
10.3390/info16020085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ensemble learning can improve predictive performance compared to the performance of any of its constituents alone, while keeping computational demands manageable. However, no reference methodology is available for developing ensemble systems. In this paper, we adapt an ensemble framework for patent classification to assist data scientists in creating flexible ensemble architectures for text classification by selecting a finite set of constituent base models from the many available alternatives. We analyze the axes along which someone can select base models of an ensemble system and propose a methodology for combining them. Moreover, we conduct experiments to compare the effectiveness of ensemble systems against base models and state-of-the-art methods on multiple datasets (three patent classification and two text classification datasets), including long and short texts and single- and/or multi-labeled texts. The results verify the generality of our framework and the effectiveness of ensemble systems, especially ensembles of classifiers trained on different data sections/metadata.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] An effective ensemble deep learning framework for text classification
    Mohammed, Ammar
    Kora, Rania
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 8825 - 8837
  • [2] Adaptive Dense Ensemble Model for Text Classification
    Xu, Yuhong
    Yu, Zhiwen
    Cao, Wenming
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 7513 - 7526
  • [3] Ensemble transfer attack targeting text classification systems
    Kwon, Hyun
    Lee, Sanghyun
    COMPUTERS & SECURITY, 2022, 117
  • [4] Ensemble of keyword extraction methods and classifiers in text classification
    Onan, Aytug
    Korukoglu, Serdar
    Bulut, Hasan
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 57 : 232 - 247
  • [5] Text length considered adaptive bagging ensemble learning algorithm for text classification
    Youwei Wang
    Jiangchun Liu
    Lizhou Feng
    Multimedia Tools and Applications, 2023, 82 : 27681 - 27706
  • [6] Text length considered adaptive bagging ensemble learning algorithm for text classification
    Wang, Youwei
    Liu, Jiangchun
    Feng, Lizhou
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (18) : 27681 - 27706
  • [7] Text Classification by Relearning and Ensemble Computation
    Ishii, Naohiro
    Yamada, Takahiro
    Bao, Yongguang
    SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, 149 : 217 - +
  • [8] An ensemble framework for patent classification
    Kamateri, Eleni
    Salampasis, Michail
    Diamantaras, Konstantinos
    WORLD PATENT INFORMATION, 2023, 75
  • [9] Ensemble Learning Based Feature Selection with an Application to Text Classification
    Onan, Aytug
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [10] A Scalable Hybrid Ensemble Model for Text Classification
    Singh, Bharat
    Kushwaha, Nidhi
    Vyas, Om Prakash
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 3148 - 3152