LCBM: A Multi-View Probabilistic Model for Multi-Label Classification

被引：28

作者：

Sun, Shiliang ^{[1
]}

Zong, Daoming ^{[1
]}

机构：

[1] East China Normal Univ, Sch Comp Sci & Technol, 3663 North Zhongshan Rd, Shanghai 200062, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2021年 / 43卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Probabilistic logic; Task analysis; Prediction algorithms; Support vector machines; Kernel; Training; Semantics; Multi-view learning; multi-label classification; Bernoulli mixture; probabilistic model; variational autoencoder; INFERENCE;

D O I：

10.1109/TPAMI.2020.2974203

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-label classification is an important research topic in machine learning, for which exploiting label dependencies is an effective modeling principle. Recently, probabilistic models have shown great potential in discovering dependencies among labels. In this paper, motivated by the recent success of multi-view learning to improve the generalization performance, we propose a novel multi-view probabilistic model named latent conditional Bernoulli mixture (LCBM) for multi-label classification. LCBM is a generative model taking features from different views as inputs, and conditional on the latent subspace shared by the views a Bernoulli mixture model is adopted to build label dependencies. Inside each component of the mixture, the labels have a weak correlation which facilitates computational convenience. The mean field variational inference framework is used to carry out approximate posterior inference in the probabilistic model, where we propose a Gaussian mixture variational autoencoder (GMVAE) for effective posterior approximation. We further develop a scalable stochastic training algorithm for efficiently optimizing the model parameters and variational parameters, and derive an efficient prediction procedure based on greedy search. Experimental results on multiple benchmark datasets show that our approach outperforms other state-of-the-art methods under various metrics.

引用

页码：2682 / 2696

页数：15

共 49 条

[21] Liu M, 2015, AAAI CONF ARTIF INTE, P2778
[22] Distinctive image features from scale-invariant keypoints
Lowe, DG
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
[23] Multiview Vector-Valued Manifold Regularization for Multilabel Image Classification
Luo, Yong
Tao, Dacheng
Xu, Chang
Xu, Chao
Liu, Hong
Wen, Yonggang
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (05) : 709 - 722
[24] Makadia A, 2008, LECT NOTES COMPUT SC, V5304, P316, DOI 10.1007/978-3-540-88690-7_24
[25] Inference for the generalization error
Nadeau, C
Bengio, Y
[J]. MACHINE LEARNING, 2003, 52 (03) : 239 - 281
[26] Modeling the shape of the scene: A holistic representation of the spatial envelope
Oliva, A
Torralba, A
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 42 (03) : 145 - 175
[27] Regional Multi-View Learning for Cardiac Motion Analysis: Application to Identification of Dilated Cardiomyopathy Patients
Puyol-Anton, Esther
Ruijsink, Bram
Gerber, Bernhard
Amzulescu, Mihaela Silvia
Langet, Helene
De Craene, Mathieu
Schnabel, Julia A.
Piro, Paolo
King, Andrew P.
[J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2019, 66 (04) : 956 - 966
[28] Quoc L., 2014, INT C MACH LEARN, P1188, DOI DOI 10.1145/2740908.2742760
[29] Rakotomamonjy A, 2008, J MACH LEARN RES, V9, P2491
[30] Classifier chains for multi-label classification
Read, Jesse
Pfahringer, Bernhard
Holmes, Geoff
Frank, Eibe
[J]. MACHINE LEARNING, 2011, 85 (03) : 333 - 359

← 1 2 3 4 5 →