Optimal performance of Binary Relevance CNN in targeted multi-label text classification

被引：9

作者：

Yang, Zhen ^{[1
]}

Emmert-Streib, Frank ^{[1
]}

机构：

[1] Tampere Univ, Fac Informat Technol & Commun Sci, Predict Soc & Data Analyt Lab, Korkeakoulunkatu 10, Tampere 33720, Finland

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 284卷

关键词：

Multi-label classification; Deep learning; Binary Relevance; Natural language processing; Artificial intelligence; ABLATION;

D O I：

10.1016/j.knosys.2023.111286

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the context of multi-label text classification (MLTC), Binary Relevance (BR) stands out as one of the most intuitive and frequently employed methodologies. It tackles the MLTC task by breaking it down into multiple binary classification problems. However, BR has faced conceptual criticism due to its omission of label dependency information. To address this limitation, numerous studies have concentrated their efforts on enhancing the incorporation of label dependencies and document features. This resulted in substantial improvements in the performance of MLTC models. While the question of whether models incorporating label dependency information consistently outperform BR models remains unanswered, the prevailing opinion suggests their superiority. In this paper, we present evidence that challenges this widely held belief. Our numerical results across various text datasets demonstrate that an optimized binary relevance convolutional neural network (BR-CNN) can outperform advanced multi-label learning models explicitly designed to leverage label dependency information as well as advanced Binary Relevance (BR) models. Our result underscores the competitiveness of a BR-CNN approach for MLTC and emphasizes the versatility of the BR model family as a customizable option. More fundamentally, our findings contribute to the ongoing discourse surrounding label dependency and provide valuable insights into the efficacy of the binary relevance approach.

引用

页数：14

共 59 条

[1] An ablation study on part-based face analysis using a Multi-input Convolutional Neural Network and Semantic Segmentation
Abate, Andrea F.
Cimmino, Lucia
Lorenzo-Navarro, Javier
[J]. PATTERN RECOGNITION LETTERS, 2023, 173 : 45 - 49
[2] Multi-class Alzheimer's disease classification using image and clinical features
Altaf, Tooba
Anwar, Syed Muhammad
Gul, Nadia
Majeed, Muhammad Nadeem
Majid, Muhammad
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2018, 43 : 64 - 74
[3] [Anonymous], 2001, P 24 ANN INT ACM SIG
[4] [Anonymous], BioNLP 2017, DOI DOI 10.18653/V1/W17-2339
[5] Hierarchical multi-label prediction of gene function
Barutcuoglu, Z
Schapire, RE
Troyanskaya, OG
[J]. BIOINFORMATICS, 2006, 22 (07) : 830 - 836
[6] Learning multi-label scene classification
Boutell, MR
Luo, JB
Shen, XP
Brown, CM
[J]. PATTERN RECOGNITION, 2004, 37 (09) : 1757 - 1771
[7] LIBSVM: A Library for Support Vector Machines
Chang, Chih-Chung
Lin, Chih-Jen
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8] Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, DOI 10.48550/ARXIV.1406.1078]
[9] A tutorial on the cross-entropy method
De Boer, PT
Kroese, DP
Mannor, S
Rubinstein, RY
[J]. ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) : 19 - 67
[10] Dembczynski K., 2010, P 27 INT C MACH LEAR, P279

← 1 2 3 4 5 6 →