SUPERVISED MULTI-MODAL TOPIC MODEL FOR IMAGE ANNOTATION

被引:0
|
作者
Tran, Thu Hoai [1 ]
Choi, Seungjin [1 ]
机构
[1] POSTECH, Div IT Convergence Engn, Pohang, South Korea
关键词
Image annotation; latent Dirichlet allocation; topic models;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multi-modal topic models are probabilistic generative models where hidden topics are learned from data of different types. In this paper we present supervised multi-modal latent Dirichlet allocation (smmLDA), where we incorporate class label (global description) into the joint modeling of visual words and caption words (local description), for image annotation task. We derive variational inference algorithm to approximately compute posterior distribution over latent variables. Experiments on a subset of LabelMe dataset demonstrate the useful behavior of our model, compared to existing topic models.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Multi-modal Image Retrieval for Search-based Image Annotation with RF
    Budikova, Petra
    Batko, Michal
    Zezula, Pavel
    2018 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2018), 2018, : 52 - 60
  • [22] Semi-supervised image clustering with multi-modal information
    Liang, Jianqing
    Han, Yahong
    Hu, Qinghua
    MULTIMEDIA SYSTEMS, 2016, 22 (02) : 149 - 160
  • [23] Semi-supervised image clustering with multi-modal information
    Jianqing Liang
    Yahong Han
    Qinghua Hu
    Multimedia Systems, 2016, 22 : 149 - 160
  • [24] WEAKLY SUPERVISED POLARIMETRIC SAR IMAGE CLASSIFICATION WITH MULTI-MODAL MARKOV ASPECT MODEL
    Yang, Wen
    Dai, Dengxin
    Wu, Jun
    He, Chu
    100 YEARS ISPRS ADVANCING REMOTE SENSING SCIENCE, PT 2, 2010, 38 : 669 - 673
  • [25] Deep Image Annotation and Classification by Fusing Multi-Modal Semantic Topics
    Chen, YongHeng
    Zhang, Fuquan
    Zuo, WanLi
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (01): : 392 - 412
  • [26] Leveraging multi-modal fusion for graph-based image annotation
    Amiri, S. Hamid
    Jamzad, Mansour
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 55 : 816 - 828
  • [27] Multi-Modal Event Topic Model for Social Event Analysis
    Qian, Shengsheng
    Zhang, Tianzhu
    Xu, Changsheng
    Shao, Jie
    IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (02) : 233 - 246
  • [28] A Multi-modal SPM Model for Image Classification
    Zheng, Peng
    Zhao, Zhong-Qiu
    Gao, Jun
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 525 - 535
  • [29] Heterogeneous Image Features Integration via Multi-Modal Semi-Supervised Learning Model
    Cai, Xiao
    Nie, Feiping
    Cai, Weidong
    Huang, Heng
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1737 - 1744
  • [30] Multi-Modal Curriculum Learning for Semi-Supervised Image Classification
    Gong, Chen
    Tao, Dacheng
    Maybank, Stephen J.
    Liu, Wei
    Kang, Guoliang
    Yang, Jie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3249 - 3260