Multi-modal kernel ridge regression for social image classification

被引:13
|
作者
Zhang, Xiaoming [1 ]
Chao, Wenhan [2 ]
Li, Zhoujun [3 ]
Liu, Chunyang [4 ]
Li, Rui [4 ]
机构
[1] Beihang Univ, Sch Cyber Sci & Technol, Beijing, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, SKLSDE, Beijing 100191, Peoples R China
[3] Beihang Univ, Beijing Key Lab Network Technol, Beijing, Peoples R China
[4] Natl Comp Network Emergency Response Tech Team Co, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Image classification; Multi-modal learning; Kernel ridge regression; Feature fusion;
D O I
10.1016/j.asoc.2018.02.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is growing interest in social image classification because of its importance in web-based image application. Though there are many approaches on image classification, it is still a great problem to integrate multi-modal contents of social images simultaneously for classification, since the textual content and visual content are represented in two heterogeneous feature spaces. In this study, a multi-modal learning algorithm is proposed to fuse the multiple features through their correlation seamlessly. Specifically, two classification modules based on the kernel ridge regression (KRR) are learned for the two types of features, and they are integrated via a joint model. With the joint model, the classification based on visual features can be reinforced by the classification based on textual features, and vice verse. Then, an efficient optimization method is proposed to resolving the object function. The query image can be classified based on both of the textual features and visual features by combing the results of the two classifiers. Two methods are proposed to combine the classification results to obtain the final result. To evaluate the approach, extensive experiments are conducted on the real-world datasets, and the result demonstrates the superiority of our approach. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:117 / 125
页数:9
相关论文
共 50 条
  • [21] Multi-modal mask Transformer network for social event classification
    Chen H.
    Qian S.
    Li Z.
    Fang Q.
    Xu C.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (02): : 579 - 587
  • [22] Multi-manifold Sparse Graph Embedding for Multi-modal Image Classification
    Li, Jingjing
    Wu, Yue
    Zhao, Jidong
    Lu, Ke
    NEUROCOMPUTING, 2016, 173 : 501 - 510
  • [23] Hierarchical Multi-Modal Prompting Transformer for Multi-Modal Long Document Classification
    Liu, Tengfei
    Hu, Yongli
    Gao, Junbin
    Sun, Yanfeng
    Yin, Baocai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6376 - 6390
  • [24] Multi-modal pedestrian detection with misalignment based on modal-wise regression and multi-modal IoU
    Wanchaitanawong, Napat
    Tanaka, Masayuki
    Shibata, Takashi
    Okutomi, Masatoshi
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [25] Multi-Modal Curriculum Learning for Semi-Supervised Image Classification
    Gong, Chen
    Tao, Dacheng
    Maybank, Stephen J.
    Liu, Wei
    Kang, Guoliang
    Yang, Jie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3249 - 3260
  • [26] Deep Image Annotation and Classification by Fusing Multi-Modal Semantic Topics
    Chen, YongHeng
    Zhang, Fuquan
    Zuo, WanLi
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (01): : 392 - 412
  • [27] Joint Multi-Modal Longitudinal Regression and Classification for Alzheimer's Disease Prediction
    Brand, Lodewijk
    Nichols, Kai
    Wang, Hua
    Shen, Li
    Huang, Heng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (06) : 1845 - 1855
  • [28] Hyperspectral Image Classification via Spectral-Spatial Shared Kernel Ridge Regression
    Zhao, Chunhui
    Liu, Wu
    Xu, Yan
    Wen, Jinhuan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (12) : 1874 - 1878
  • [29] MULTI-MODAL APPROACH TO INDEXING AND CLASSIFICATION
    SWIFT, DF
    WINN, VA
    BRAMER, DA
    INTERNATIONAL CLASSIFICATION, 1977, 4 (02): : 90 - 94
  • [30] Multi-modal Semantic Place Classification
    Pronobis, A.
    Mozos, O. Martinez
    Caputo, B.
    Jensfelt, P.
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2010, 29 (2-3): : 298 - 320