Multi-Level Second-Order Few-Shot Learning

被引:21
|
作者
Zhang, Hongguang [1 ]
Li, Hongdong [2 ]
Koniusz, Piotr [2 ,3 ]
机构
[1] AMS, Syst Engn Inst, Shanghai 100141, Peoples R China
[2] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 0200, Australia
[3] CSIRO, Data61, Acton, ACT 2601, Australia
基金
中国国家自然科学基金;
关键词
Task analysis; Pipelines; Image recognition; Visualization; Feature extraction; Training; Streaming media; Few-shot learning; second-order statistics; image classification; action recognition; FINE-GRAINED IMAGE; COVARIANCE; RETRIEVAL;
D O I
10.1109/TMM.2022.3142955
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a Multi-level Second-order (MlSo) few-shot learning network for supervised or unsupervised few-shot image classification and few-shot action recognition. We leverage so-called power-normalized second-order base learner streams combined with features that express multiple levels of visual abstraction, and we use self-supervised discriminating mechanisms. As Second-order Pooling (SoP) is popular in image recognition, we employ its basic element-wise variant in our pipeline. The goal of multi-level feature design is to extract feature representations at different layer-wise levels of CNN, realizing several levels of visual abstraction to achieve robust few-shot learning. As SoP can handle convolutional feature maps of varying spatial sizes, we also introduce image inputs at multiple spatial scales into MlSo. To exploit the discriminative information from multi-level and multi-scale features, we develop a Feature Matching (FM) module that reweights their respective branches. We also introduce a self-supervised step, which is a discriminator of the spatial level and the scale of abstraction. Our pipeline is trained in an end-to-end manner. With a simple architecture, we demonstrate respectable results on standard datasets such as Omniglot, mini-ImageNet, tiered-ImageNet, Open MIC, fine-grained datasets such as CUB Birds, Stanford Dogs and Cars, and action recognition datasets such as HMDB51, UCF101, and mini-MIT.
引用
收藏
页码:2111 / 2126
页数:16
相关论文
共 50 条
  • [31] A few-shot link prediction framework to drug repurposing using multi-level attention network
    Yang, Chenglin
    Chen, Xianlai
    Huang, Jincai
    An, Ying
    Huang, Zhenyu
    Sun, Yu
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
  • [32] Multi-Scale Metric Learning for Few-Shot Learning
    Jiang, Wen
    Huang, Kai
    Geng, Jie
    Deng, Xinyang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 1091 - 1102
  • [33] Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional Signatures
    Wang, Xuyang
    Du, Yajun
    Chen, Danroujing
    Li, Xianyong
    Chen, Xiaoliang
    Fan, Yongquan
    Xie, Chunzhi
    Li, Yanli
    Liu, Jia
    APPLIED SCIENCES-BASEL, 2023, 13 (02):
  • [34] MuL-GRN: Multi-Level Graph Relation Network for Few-Shot Node Classification
    Zhang, Lingling
    Wang, Shaowei
    Liu, Jun
    Chang, Xiaojun
    Lin, Qika
    Wu, Yaqiang
    Zheng, Qinghua
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 6085 - 6098
  • [35] Multi-level Semantic Fusion Network For Few-shot Multimedia Image Recognition In Education Management
    Yuan, Chunlin
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2025, 28 (02): : 227 - 235
  • [36] Multi-level similarity transfer and adaptive fusion data augmentation for few-shot object detection
    Zhu, Songhao
    Wang, Yi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 105
  • [37] LLM-based Multi-Level Knowledge Generation for Few-shot Knowledge Graph Completion
    Li, Qian
    Chen, Zhuo
    Ji, Cheng
    Jiang, Shiqi
    Li, Jianxin
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 2135 - 2143
  • [38] Multi-Branch Network for Few-shot Learning
    Ren, Kai
    Guo, Zijie
    Zhang, Zhimin
    Zhu, Rui
    Li, Xiaoxu
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 520 - 525
  • [39] Few-shot Learning for Multi-Modality Tasks
    Chen, Jie
    Ye, Qixiang
    Yang, Xiaoshan
    Zhou, S. Kevin
    Hong, Xiaopeng
    Zhang, Li
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5673 - 5674
  • [40] Multi-Prototype Few-shot Learning in Histopathology
    Deuschel, Jessica
    Firmbach, Daniel
    Geppert, Carol, I
    Eckstein, Markus
    Hartmann, Arndt
    Bruns, Volker
    Kuritcyn, Petr
    Dexl, Jakob
    Hartmann, David
    Perrin, Dominik
    Wittenberg, Thomas
    Benz, Michaela
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 620 - 628