Neural Message Passing for Multi-label Classification

被引:26
作者
Lanchantin, Jack [1 ]
Sekhon, Arshdeep [1 ]
Qi, Yanjun [1 ]
机构
[1] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22903 USA
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II | 2020年 / 11907卷
关键词
D O I
10.1007/978-3-030-46147-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label classification (MLC) is the task of assigning a set of target labels for a given sample. Modeling the combinatorial label interactions in MLC has been a long-haul challenge. We propose Label Message-Passing (LaMP) Neural Networks to efficiently model the joint prediction of multiple labels. LaMP treats labels as nodes on a label-interaction graph and computes the hidden representation of each label node conditioned on the input using attention-based neural message passing. Attention enables LaMP to assign different importances to neighbor nodes per label, learning how labels interact (implicitly). The proposed models are simple, accurate, interpretable, structure-agnostic, and applicable for predicting dense labels since LaMP is incredibly parallelizable. We validate the benefits of LaMP on seven real-world MLC datasets, covering a broad spectrum of input/output types and outperforming the state-of-the-art results. Notably, LaMP enables intuitive interpretation of how classifying each label depends on the elements of a sample and at the same time rely on its interaction with other labels (We provide our code and datasets at https://github.com/QData/LaMP.).
引用
收藏
页码:138 / 163
页数:26
相关论文
共 59 条
[1]  
[Anonymous], 2011, 22 INT JOINT C ART I, DOI DOI 10.5591/978-1-57735-516-8/IJCAI11-220
[2]  
[Anonymous], 2018, ARXIV PREPRINT ARXIV
[3]  
[Anonymous], 2001, Conditional random fields: Probabilistic models for segmenting and labeling sequence data
[4]  
[Anonymous], 2016, PMLR
[5]  
[Anonymous], 2005, P 14 ACM INT C INFOR, DOI DOI 10.1145/1099554.109959
[6]  
Ba JL., 2016, ARXIV
[7]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[8]  
Battaglia P.W., 2018, arXiv preprint arXiv:1806.01261
[9]  
Battaglia PW, 2016, ADV NEUR IN, V29
[10]  
Bhatia Kush, 2015, Advances in neural information processing systems, V28