LCBM: A Multi-View Probabilistic Model for Multi-Label Classification

被引:28
作者
Sun, Shiliang [1 ]
Zong, Daoming [1 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, 3663 North Zhongshan Rd, Shanghai 200062, Peoples R China
基金
中国国家自然科学基金;
关键词
Probabilistic logic; Task analysis; Prediction algorithms; Support vector machines; Kernel; Training; Semantics; Multi-view learning; multi-label classification; Bernoulli mixture; probabilistic model; variational autoencoder; INFERENCE;
D O I
10.1109/TPAMI.2020.2974203
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label classification is an important research topic in machine learning, for which exploiting label dependencies is an effective modeling principle. Recently, probabilistic models have shown great potential in discovering dependencies among labels. In this paper, motivated by the recent success of multi-view learning to improve the generalization performance, we propose a novel multi-view probabilistic model named latent conditional Bernoulli mixture (LCBM) for multi-label classification. LCBM is a generative model taking features from different views as inputs, and conditional on the latent subspace shared by the views a Bernoulli mixture model is adopted to build label dependencies. Inside each component of the mixture, the labels have a weak correlation which facilitates computational convenience. The mean field variational inference framework is used to carry out approximate posterior inference in the probabilistic model, where we propose a Gaussian mixture variational autoencoder (GMVAE) for effective posterior approximation. We further develop a scalable stochastic training algorithm for efficiently optimizing the model parameters and variational parameters, and derive an efficient prediction procedure based on greedy search. Experimental results on multiple benchmark datasets show that our approach outperforms other state-of-the-art methods under various metrics.
引用
收藏
页码:2682 / 2696
页数:15
相关论文
共 49 条
  • [21] Liu M, 2015, AAAI CONF ARTIF INTE, P2778
  • [22] Distinctive image features from scale-invariant keypoints
    Lowe, DG
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
  • [23] Multiview Vector-Valued Manifold Regularization for Multilabel Image Classification
    Luo, Yong
    Tao, Dacheng
    Xu, Chang
    Xu, Chao
    Liu, Hong
    Wen, Yonggang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (05) : 709 - 722
  • [24] Makadia A, 2008, LECT NOTES COMPUT SC, V5304, P316, DOI 10.1007/978-3-540-88690-7_24
  • [25] Inference for the generalization error
    Nadeau, C
    Bengio, Y
    [J]. MACHINE LEARNING, 2003, 52 (03) : 239 - 281
  • [26] Modeling the shape of the scene: A holistic representation of the spatial envelope
    Oliva, A
    Torralba, A
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 42 (03) : 145 - 175
  • [27] Regional Multi-View Learning for Cardiac Motion Analysis: Application to Identification of Dilated Cardiomyopathy Patients
    Puyol-Anton, Esther
    Ruijsink, Bram
    Gerber, Bernhard
    Amzulescu, Mihaela Silvia
    Langet, Helene
    De Craene, Mathieu
    Schnabel, Julia A.
    Piro, Paolo
    King, Andrew P.
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2019, 66 (04) : 956 - 966
  • [28] Quoc L., 2014, INT C MACH LEARN, P1188, DOI DOI 10.1145/2740908.2742760
  • [29] Rakotomamonjy A, 2008, J MACH LEARN RES, V9, P2491
  • [30] Classifier chains for multi-label classification
    Read, Jesse
    Pfahringer, Bernhard
    Holmes, Geoff
    Frank, Eibe
    [J]. MACHINE LEARNING, 2011, 85 (03) : 333 - 359