Optimal performance of Binary Relevance CNN in targeted multi-label text classification

被引:9
作者
Yang, Zhen [1 ]
Emmert-Streib, Frank [1 ]
机构
[1] Tampere Univ, Fac Informat Technol & Commun Sci, Predict Soc & Data Analyt Lab, Korkeakoulunkatu 10, Tampere 33720, Finland
关键词
Multi-label classification; Deep learning; Binary Relevance; Natural language processing; Artificial intelligence; ABLATION;
D O I
10.1016/j.knosys.2023.111286
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of multi-label text classification (MLTC), Binary Relevance (BR) stands out as one of the most intuitive and frequently employed methodologies. It tackles the MLTC task by breaking it down into multiple binary classification problems. However, BR has faced conceptual criticism due to its omission of label dependency information. To address this limitation, numerous studies have concentrated their efforts on enhancing the incorporation of label dependencies and document features. This resulted in substantial improvements in the performance of MLTC models. While the question of whether models incorporating label dependency information consistently outperform BR models remains unanswered, the prevailing opinion suggests their superiority. In this paper, we present evidence that challenges this widely held belief. Our numerical results across various text datasets demonstrate that an optimized binary relevance convolutional neural network (BR-CNN) can outperform advanced multi-label learning models explicitly designed to leverage label dependency information as well as advanced Binary Relevance (BR) models. Our result underscores the competitiveness of a BR-CNN approach for MLTC and emphasizes the versatility of the BR model family as a customizable option. More fundamentally, our findings contribute to the ongoing discourse surrounding label dependency and provide valuable insights into the efficacy of the binary relevance approach.
引用
收藏
页数:14
相关论文
共 59 条
  • [1] An ablation study on part-based face analysis using a Multi-input Convolutional Neural Network and Semantic Segmentation
    Abate, Andrea F.
    Cimmino, Lucia
    Lorenzo-Navarro, Javier
    [J]. PATTERN RECOGNITION LETTERS, 2023, 173 : 45 - 49
  • [2] Multi-class Alzheimer's disease classification using image and clinical features
    Altaf, Tooba
    Anwar, Syed Muhammad
    Gul, Nadia
    Majeed, Muhammad Nadeem
    Majid, Muhammad
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2018, 43 : 64 - 74
  • [3] [Anonymous], 2001, P 24 ANN INT ACM SIG
  • [4] [Anonymous], BioNLP 2017, DOI DOI 10.18653/V1/W17-2339
  • [5] Hierarchical multi-label prediction of gene function
    Barutcuoglu, Z
    Schapire, RE
    Troyanskaya, OG
    [J]. BIOINFORMATICS, 2006, 22 (07) : 830 - 836
  • [6] Learning multi-label scene classification
    Boutell, MR
    Luo, JB
    Shen, XP
    Brown, CM
    [J]. PATTERN RECOGNITION, 2004, 37 (09) : 1757 - 1771
  • [7] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [8] Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, DOI 10.48550/ARXIV.1406.1078]
  • [9] A tutorial on the cross-entropy method
    De Boer, PT
    Kroese, DP
    Mannor, S
    Rubinstein, RY
    [J]. ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) : 19 - 67
  • [10] Dembczynski K., 2010, P 27 INT C MACH LEAR, P279