Multimodal Ensembling for Zero-Shot Image Classification

被引:0
|
作者
Hickmon, Javon [1 ]
机构
[1] Univ Washington, Dept Comp Sci, Seattle, WA 98195 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial intelligence has made significant progress in image classification, an essential task for machine perception to achieve human-level image understanding. Despite recent advances in vision-language fields, multimodal image classification is still challenging, particularly for the following two reasons. First, models with low capacity often suffer from underfitting and thus underperform on fine-grained image classification. Second, it is important to ensure high-quality data with rich cross-modal representations of each class, which is often difficult to generate. Here, we utilize ensemble learning to reduce the impact of these issues on pre-trained models. We aim to create a meta-model that combines the predictions of multiple open-vocabulary multimodal models trained on different data to create more robust and accurate predictions. By utilizing ensemble learning and multimodal machine learning, we will achieve higher prediction accuracies without any additional training or fine-tuning, meaning that this method is completely zero-shot.
引用
收藏
页码:23747 / 23749
页数:3
相关论文
共 50 条
  • [41] Triple discriminator generative adversarial network for zero-shot image classification
    Zhong JI
    Jiangtao YAN
    Qiang WANG
    Yanwei PANG
    Xuelong LI
    Science China(Information Sciences), 2021, 64 (02) : 5 - 18
  • [42] Zero-Shot Image Classification via Coupled Discriminative Dictionary Learning
    Liu, Lehui
    Wu, Songsong
    Chen, Runqing
    Zhou, Mengquan
    INTELLIGENT COMPUTING, NETWORKED CONTROL, AND THEIR ENGINEERING APPLICATIONS, PT II, 2017, 762 : 363 - 372
  • [43] Fusing spatial and frequency features for compositional zero-shot image classification
    Li, Suyi
    Jiang, Chenyi
    Ye, Qiaolin
    Wang, Shidong
    Yang, Wankou
    Zhang, Haofeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [44] Zero-shot image classification method base on deep supervised alignment
    Zeng S.-J.
    Pang S.-M.
    Hao W.-Y.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (11): : 2204 - 2214
  • [45] Fast Zero-Shot Image Tagging
    Zhang, Yang
    Gong, Boqing
    Shah, Mubarak
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5985 - 5994
  • [46] An Optical Image-Aided Approach for Zero-Shot SAR Image Scene Classification
    Ma, Yanjing
    Pei, Jifang
    Zhang, Xing
    Huo, Weibo
    Zhang, Yin
    Huang, Yulin
    Yang, Jianyu
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [47] Multimodal zero-shot learning for tactile texture recognition ☆
    Cao, Guanqun
    Jiang, Jiaqi
    Bollegala, Danushka
    Li, Min
    Luo, Shan
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2024, 176
  • [48] Robust deep alignment network with remote sensing knowledge graph for zero-shot and generalized zero-shot remote sensing image scene classification
    Li, Yansheng
    Kong, Deyu
    Zhang, Yongjun
    Tan, Yihua
    Chen, Ling
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 179 : 145 - 158
  • [49] Side-Scan Sonar Image Classification With Zero-Shot and Style Transfer
    Bai, Zhongyu
    Xu, Hongli
    Ding, Qichuan
    Zhang, Xiangyue
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 15
  • [50] Zero-Shot Image Classification Algorithm Based on SIF Fusion Semantic Tags
    Automatic Control and Computer Sciences, 2022, 56 : 364 - 373